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36017DK01 Varemeerkestyrelsen 

1 30 DEC. 2003 

Modtaget 

COMPOUNDS USEFUL IN THE DIAGNOSIS AND TREATMENT OF MALARIA 
Field of the Invention 

The present invention relates to the fields of preventing or treating malaria and it provides 
compounds, which are useful within these fields. These compounds may constitute parts of 
5 pharmaceutical compositions and vaccines and be used In methods of treatment, as 
medicaments and for the manufacture of compositions and/or these compounds may 
provide basis for a method of generating a vaccine against malaria. Furthermore, the 
invention relates to the use of these compounds as biotechnologlcal tools and in in vitro 
diagnostic methods and kits. 

10 General background 

Malaria constitutes a permanent catastrophe. Annually, the disease kills between 1 and 2 
million Africans and the economic losses due to malaria constitute a hindrance for 
economic development. In areas of stable malaria transmission the disease mainly affects 
children, because adults have acquired immunity. In these areas Immunity to severe 

15 malaria and protection against malaria deaths is acquired early in life after a few clinical 
infections, whereas the acquisition of immunity that protect Individuals from uncomplicated 
febrile malaria episodes is a much more sluggish process, requiring years of exposure. 
Thus, the epidemiological data indicate that immunity against severe disease Is mediated 
through a different mechanism than immunity against uncomplicated disease, and that the 

20 targets on the parasites for these two kinds of malaria immunity are separate. 

Malaria is caused by unicellular parasites living and multiplying asexually in the red blood 
cells (RBC). In each 48-hour cycle, the parasites invade RBC, multiply within them, and 
eventually burst them, before they go on to invade new RBC, 

25 

Plasmodium falciparum Is the most virulent of the four species causing malaria and 
responsible for most malarial deaths. The particular virulence of P, falciparum Is due to the 
ability of Infected erythrocytes to adhere to a variety of host receptors on the endothelial 
lining such as ICAM-1, VCAM, thrombospondin, ELAM-1, and CD36, and avoid splenic 
30 clearance. Unchecked growth and the accumulation of sequestered parasites in vital 
organs such as the brain are crucial elements in the pathogenesis of severe malaria. 

Sequestration is mediated through parasite-encoded, clonally variant surface antigens 
(VSA) inserted into the membrane of the infected RBC and is thought to be an immune 
35 evasion strategy evolved to avoid splenic clearance. The VSA expressed on infected 
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erythrocytes can be divided into serological types using plasma from individuals living in 
malaria endemic areas as typing reagents. 

By flow cytometry analyses, two main serotypes have been defined, one, VSAsm is mainly 
5 expressed by parasites causing severe and life-threatening P. falciparum malaria, and the 
other, VSAuM is dominantly expressed during uncomplicated malaria infections in semi- 
immune individuals, 

iMaturalfy acquired immunity to malaria is mediated by plasma IgG, which control the 
10 growth of P. falciparum, VSAs are probably the main target of these antibodies, as 

acquisition of protection from P. falciparum malaria corresponds to a gradual accumulation 
of IgG with a broad range of VSA-spedficities, Furthermore, VSA-specific immune 
responses steadily restrict the repertoire of VSA that are compatible with parasite survival, 
and drive VSA expression away from VSAsm towards VSAu„ (Nielsen et aL, 2002). VSA 
15 expression therefore is non-random as it depends on the degree of immunity in the 
infected host. 

The best-characterised VSA are encoded by the var genes. This gene family, encompassing 
about 60 members per genome, encodes a highly polymorphic set of variant proteins, 
20 which collectively have been named P. falciparum erythrocyte membrane protein 1 
(PfEMPl). These proteins are located on the surface of the P. falc/parum-infected 
erythrocytes and have been shown to mediate adhesion to a number of host receptors. 

The var genes contain a large variable 5' exon and a more conserved 3' exon separated by 
25 an intron. The 5' exon normally contains one or two Cysteine-rich Inter-Domain Regions 
(CIDR) and from two to seven domains designated DBL after their similarity to the Duffy 
Binding Ligand of P. vivax (Smith et al. 2000), A given parasite expresses only one PfEMPl 
at a time, but in each generation a fraction of the daughter parasites may switch to 
expression of alternative PfEMPl species through an unknown process. Different PfEMPl 
30 molecules have different receptor specificities, and clonal switching between expression of 
the various var gene products in a mutually exclusive manner allows the parasite to modify 
its adhesion properties (Wahlgren et aL, 1999). 

Thus, the currently available data indicate that severe malaria is caused by parasites 
35 expressing a subset of PfEMPl molecuies and that antibodies directed against these are 
responsible for the protection against severe disease acquired eariy in life by children in 
endemic areas. It fo»ows that the specific identification of the particular PfEMPl molecules 
expressed by the parasites causing severe malaria Is importiant for malaria vaccine 
development. 
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The obvious strategies followed in many laboratories are either to attempt to identify the 
var gene that is dominantly transcribed or to determine which PfEMPl Is expressed by 
parasites isolated from patients suffering from severe malaria. 

5 

Both strategies have been halted because the var genes show extensive intra- and inter- 
genomic variation. 

Attempts to Identify VSAsw-type var gene transcription has been foiled because primer bias 
10 and concomitant transcription of several var genes makes It is impossible to quantify and 
compare transcription of these genes, when all the target nucleic acid sequences are not 
known. The proteomic approach has proved difficult, partly due to the variation in the 
repertoire already mentioned, partly because PfEMPl is expressed at low levels on the 
surface of erythrocytes, and it has been difficult to obtain sufficient amounts of protein for 
15 a reliable mass spectrometry approach. 

Frustrated by these efforts several laboratories has tried to identify the most relevant 
PfEMPl by generating parasite lines with known adhesion phenotype using repeated 
rounds of panning and identifying the dominantly transcribed var genes. This approach has 
20 identified PfEMPl molecules with a known receptor affinity, but since parasites causing 
severe malaria have never been shown to possess a common binding phenotype, the 
relevance of these PfEMPl molecules in the pathogenesis of severe malaria is unclear. 

The PfEMPls constitute a large and polymorphic family. The proteins described in this 
25 application serve a unique function for the parasite In providing high growths rates in non- 
immune individuals and renders these parasites a unique serological phenotype. This 
discovery is of great potential importance because it makes it possible to design strategies 
that will specifically aim at reducing malaria deaths by preventing, diagnosing, and treating 
severe malaria. 

30 

The entire P. falciparum genome of one parasite isolate, 3D7, have previously been data- 
mined using bioinformatic tools. Based on both coding- and non-coding regions var genes 
have been found to group into three major groups (group A, B and C) and two 
intermediate groups B/A and B/C representing transitions between the three major groups. 
35 Group A consists of ten genes consistently Identified as a distinct group by sequence 
analysis CLavstsen et al. 2003). 

Although the proposed grouping of var genes in 3D7 is common knowledge to a person 
skilled in the art: and the sequence of the entire 3D7 P, fyldparum genome is known as 
40 sequence submissions, information on function of SEQ ID 2, SEQ ID NO.: 4, and SEQ ID 
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NO.: 6 in the pathogenesis of malaria and their relevance for a vaccine against severe 
malaria has not previously been described. 

In the 3D7 isolate. Group A contains 10 var genes of which we have identified three var 
5 genes, using a novel method, as being functionally relevant for development of a vaccine 
agasint severe malaria. However, the global repertoire of var genes is unl<now, but 
estimated to encode more than 10^' possible variants PfEMPl. 

Summary of the Invention 

10 In essence, the Inventive concept described herein is based on the observation that three 
genes, PFD1235w/MAL7Pl,l, PF11_0008, and PF13__0003 (referred to as SEQ ID NO.: 1, 
SEQ ID NO,: 3, and SEQ ID NO.: 5) are transcriptionally up-regulated in parasites of the 
species Plasmodium faiciparum, when these parasites have been selected for Increased 
antibody recognition by a novel selection method, described In the present application, 

15 using plasma from semi-immune children. This selection conway parasites a unique 
phenotype characteristic of parasites causing severe malaria. This phenotype Is also 
obtained when parasites are selected for adhesion to a special kind of bone marrow 
derived endothelical cells and also results in the up-regulation of SEQ ID NO.: 1, SEQ ID 
NO.: 3, and SEQ ID NO.: 5. As the cytoadheslon to human endothelial cells and the 

20 antibody recognition of infected RBC is intimately linked to severe malaria, products of 
these genes provide for novel approaches to diagnosing and treating malaria 
prophylactically and/or therapeutically. 

In the broadest sense, the present invention relates to the polypeptides, VAR4, VARS, and 
25 VAR6 (referred to as SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6) encoded by SEQ 
ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 3, respectively or parts hereof as well as 
polypeptides, which with respect to their sequence are identical in part to such sequences. 
In addition, the invention relates to the SEQ ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 
nucleic acid molecule or parts hereof as well as nucleic acid molecules, which with respect 
30 to their sequence are identical In part to such sequences. 

A presently preferred embodiment relates to medical uses of any of the polypeptides 
and/or nucleic acids according to the present invention as well as methods of treatment of 
malaria comprising molecules of the invention. 

35 

Other aspects of the invention include pharmaceutical compositions and vaccines based on 
the molecules of the invention. In addition, the invention comprises polypeptides or nucleic 
acid molecules of the invention as medicaments, and the use of these polypeptides and 
nucleic acids for the manufacture of compositions, hereunder-immunogenic compositions 
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Which are to administered in order to prophylactically or therapeutically reduce the 
inddence, prevalence or severity of malaria, especially severe malaria. 

It is further within the scope of the present invention to provide a method of treatment 
5 and prevention of malaria, which comprises administering an effective amount of one or 
more of the described molecules of the invention to a subject. 

It will appear that the mentioned polypeptides and nucleic acid molecules will also be 
useful as biotechnological tools. Therefore, the Invention also relates to fn vitro diagnostic 

10 methods, which comprise contacting a sample with polypeptides or nucleic acid molecules 
having the sequences described above, allowing in vitro reactions to occur and 
subsequently detecting any molecular complexes formed. These may for Instance be 
complexes of antigens and antibodies. In some aspects of the invention, the polypeptides 
of the invention are parts of diagnostic kits. Alternatively, these l<its may comprise 

15 antibodies, which specifically recognise such polypeptides. Kits may also compromise, but 
are not limited to oligonucleotides as part of diagnostic kits based on techniques such as 
array, PCR, and real-time quantitative PCR. 

Detailed Description of the Invention 

The new strategy 

20 The present inventors have taken a novel approach and based their strategy on the hereby 
disclosed knowledge of the serological phenotype expressed by parasites causing severe 
malaria (VSAsm) and the availability of all var genes sequences in the parasite line 3D7. 

In culture 3D7 will normally express a typical VSAu„ serotype, thus the present inventore 
25 first developed a method to generate VSAs„ expressing 3D7 lines and then used 
quantitative real time PCR to identify the dominant var gene transcripts. 

They hereafter made specific probes and antibodies, that allowed them to identify the 
PfEMPl molecules expressed on the surface of the erythrocytes responsible for the 
30 serological phenotype carried by the selected parasite line. 

This serological phenotype was also obtained when the 3D7 parasite line was selected for 
adhesion to bone marrow derived endothelial cells. 

35 The relevance of the discovered PfEMPl molecules were further substantiated by the fact 
that parasites expressing the molecules h&ve a high growth rate in non-immune 
individuals, and that var genes with a high similarity to the discovered genes can be 
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identified In other parasites. Tlieir approach is completely different from previous 
strategies of defining variant molecules responsible for binding to different receptors. 

The strategy ts based on both epidemiological studies showing that immunity to severe 
5 malaria is acquired more rapidly than protection from uncomplicated disease and sub- 
clinical Infection findings Indicating that VSAsm constitute a restricted and anttgenlcaify 
conserved VSA subset, whereas VSAum are more diverse. 

Together, these observations enabled developing of the disease-ameliorating strategies 
10 presented in the present application, which can protect against mortality and severe 
morbidity by accelerating acquisition of immunity to VSAsM-expressIng parasites and 
thereby forcing VSA expression away from VSAsm. 

As highlighted earlier Piasmodium falciparum erythrocyte membrane protein-1 (PfEMPl), 
15 are a highly polymorphic and diverse family of proteins. Every parasite genome carries 
about 60 genes encoding PfEMPl and the repertoire of PfEMPl genes differ from parasite 
genome to parasite genome. Thus, PfEMPl genes show both intra- and inter-genomic 
variation, and the global repertoire of PfEMPl proteins Is unknown, but estimated to be 
very large, in fact the present inventors have estimated that there are more 10^^ possible 
20 variants of this protein. The common features shared by the PfEMPl family of genes and 
proteins are the organization of the genes (two exons and an intron), and the presence of 
domain structures that can be classified as Duffy Binding Ligand-lil<e (DBL) or Cysteine- 
Rich Interdomain Region (CIDR). 

25 In addition, the proteins share a relative conserved C-terminal tall consisting of a trans- 
membrane region and a relatively short intracellular domain. However, it must be stressed 
that the genes and the encoded proteins vary considerably between each other; both with 
regards to sequence (primary structure) and organization of the domains (Lavstsen et aL, 
2003). 

30 

It is also clear that expression of different PfEMPl molecules confer parasite different 
functional (Smith et aL, 2000; Robinson et aL, 2003) and antigenic characteristics (Salanti 
et aL, 2003). Within PfEMPl domains classified as belonging to the same group and 
subgroup (I.e. DBLa, DBLp, CIDRy etc) short identity blocks of 2-14 amino acids can be 
35 identified between hyper variable blocks of varying lengths (of up to several hundred 
amino acids) m which there Is no or very little homology between randomly chosen 
PfEMPl. Thus, PfEMPl molecules constitute a very large and diverse family of proteins. 
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Inventive concept 

The inventive concept disclosed in the present application Is based on the unexpected 
observation that the transcription of 3 specific Plasmodium falciparum var genes, SEQ ID 
NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5, all members of Group A var genes, is up- 
5 regulated in parasite lines and clones carrying the VSAsm phenotype. This up-regulation 
followed both after antibody-selectjon and selection for adhesion of infected RBC to bone 
marrow derived endotheiial cells in vitro. 

These observations shows that the proteins encoded by these three specific var genes are 
10 responsible for inducing the first IgG with specificity to PfEMPl on iRBC and thus the 
dominant PfEMPl expressed during R falciparum Infections in immunologically naive 
individuals. Such proteins are useful as therapeutic and prophylactic agents as well as 
biological tools and diagnostic agents for the study, treatment and prevention of malaria, 
since these proteins described serve a unique function for the parasite in providing high 
15 growths rates in non-immune individuals and renders these parasites a unique serological 
phenotype. This is of great potential importance because it makes It possible to design 
strategies that will specifically aim at reducing malaria deaths by preventing, diagnosing, 
and treating malaria. 

Polypeptide molecules of the invention 
20 Thus, In its broadest aspect, the present invention relates to these 3 isolated polypeptides 
comprising at least one amino acid sequence selected from the group consisting of at least 
one of 

a) SEQ ID NO. : 2, SEQ ID NO. : 4 or SEQ ID NO. : 6, and 
2^ t>) a sequence having at least 80% sequence identity to a), and 

c) sub-sequences of a) or b) with a minimum length of 10 amino acids, 
and 

d) sub-sequences of a) b) comprising at least one B-cell epitope; 

30 

With the proviso that the Exon 2 of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 is 
excluded. 

With the proviso that the "Fragment 1" and/or ^Tragment 2" and/or "Fragment 3" and/or 
35 '^Fragment 4" and /or "Fragment S" and/or ^Tragment 6" and/or "Fragment 7" and/or 
''Fragment 8" and/or 'Tragment 9" and/or ^Fragment 10" and/or ^'Fragment 11" and/or 
''Fragment 12" and/or ^Tragment 13" and/or ^'Fragment 14" and/or ^'Fragment 15" and/or 
"Fragment 16" and/or "Fragment 17" "Fragment 18" and/or "Fragment 19" and/or 
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"Fragment 20" and/or "Fragment 21" and/or ^Tragment 22" and/or "Fragment 23" and/or 
"Fragment 24" and/or "Fragment 25" and/or "Fragment 26" of SEQ ID NO.: 2, SEQ ID 
NO.: 4, and/or SEQ ID NO,: 6 is excluded. 

5 With the proviso that the "Fragment 27" and/or "Fragment 28" and/or "Fragment 29" 
and/or "Fragment 30" and/or of SEQ ID NO.; 2 and/or SEQ ID NO.: 6 is excluded. 

With the proviso that the "Fragment 31" and/or "Fragment 32" of SEQ ID NO.: 2 is 
excluded. 

10 

With the proviso that the amino acid sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or 
SEQ ID NO.: 6 is excluded. 

For the present and any of the following aspects of the Invention it applies that it is an 
15 object of preferred embodiments of the present Invention to provide polypeptides, which 
are subject to antibody recognition by antibodies in sera from young children living in 
areas of high malaria transmission intensity and/or capable of mediating cyto-adhesion of 
intact erythrocyte infected by a parasite to endotheMal cells, but not the CD36 receptor. 
The dtstinction to CD36 adhesion can be evaluated by the examples of the present 
20 Invention, e.g. as described in example 4. 

Length of the molecules 

For all the aspects of the invention, it is apparent that the polypeptides of the mvention, 
which form the basis of the described embodiments of the invention may be less or equal 
25 to any length between 6 - 3552 (SEQ ID NO.: 2), 6 - 2992 (SEQ ID NO.: 4) and 6 - 3344 
{SEQ ID NO.: 6) amino acids, since these polypeptide alt have great immunogenic 
capabilities and thus are especially great as candidates for the development for the 
treatment of malaria, particularly in the development of a vaccine. 

30 Thus the present invention relates to any sub-sequence originating from any of the 

polypeptides having an amino acid length such as but not limited to less than or equai to 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 
53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 

35 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 
99, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 225, 250, 275, 300, 350, 
400, 450, 500, 600, 700, 800, 900, 1000, 1250 , 1500, 1750, 2000, 2250, 2500, 2750, or 
2990 amino acids In length. In a further embodiment the length of a sub-sequence of SEQ 
ID NO.: 2 can be less than or equal to 3000, 3100, 3200, 3300, 3400, 3500 or 3550 amino 
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acids in length, whereas the length of a sub-sequence of SEQ ID NO.: 6 furthermore can 
be less than or equal to 3000, 3100, 3200, 3300 or 3340 amino acids in length. 

In addition to these sequences, fragments or sub-sequences of the polypeptide of the 
5 Invention, larger protetns/polypeptides comprising such sub-sequences as part of their 
sequence, are aiso embodiments of the present invention, thus with respect to all aspects 
of the invention it may be preferred that the polypeptides of the invention may have a 
length above the disclosed full length of 3553 (SEQ ID NO,: 2), 2994 (SEQ ID NO,: 4) or 
3346 (SEQ ID NO.: 6) amino acids, if combined with other amino acids for e.g. preparation 
10 purposes. 

Some characteristic lie within the polypeptide sequence of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and SEQ ID NO.: 6 such as B- and T-cell epitopes. 

T-cell epitopes were defined using the SYFPEITHI server at Centre for Biological Sequence 
15 Analysis BloCentrum-DTU at the Technical University of Denmark ( h ttp : //syf peith i . bmi~ 
heidetbera.com/Scripts/MHCServer.dll/ EpPredict.htm ) with default settings. B-cell 
epitopes were defined using the Protean 4.0 software rn the DNAstar package Iwith default 
settings. 

For SEQ ID NO.: 2 predicted T-cell epitopes are defined as, but not limited to amino acid 
20 no. 13-45, 93-107, 135-167, 201--236, 247-261, 302-316, 327-348, 394-409, 432-446, 
457-473, 532-546, 573-589, 595-609, 625-639, 678-692, 870-890, 942-956, 986-1000, 
1097-1111, 1130-1144, 1151-1165, 1196-1210, 1473-1492, 1508-1548, 1593-1630, 
1633-1647, 1800-1814, 1830-1844, 1923-1937, 1943-1957, 1997-2025, 2235-2249, 
2322-2336, 2378-2404, 2427-2441, 2464-2478, 2507-2521, 2533-2553, 2608- 
25 2637,26662-2683, 2693-2720, 2794-2808, 2851-2866, 2879-2904, 2965-2979, 3074- 
3088, 3092-3149, 3201-3216, 3236-3250, 3274-3288, 3297-3311, 3322-3336, 3350- 
3364, 3375-3395, 3488-3502, 3530-3544. 

For SEQ ID NO.: 2 predicted B-cell epitopes are defined as, but not limited to amino acid 
30 no. 197-281, 365-470, 526-575, 631-772, 820-891, 905-933, 961-1003, 1024-1066, 
1073-1188, 1206-1248, 1269-1367, 1402-1437, 1535-1731, 1991-2026, 2054-2096, 
2124-2229, 2278-2313, 2404-2460, 2488-2530, 2663-2713, 2832-2965, 3014-3112, 
3154-3210, 3322-3406. 

35 For SEQ ID NO.: 4 predicted T-cell epitopes are defined as, but not limited to amino acid 
no. 117-131, 154-168, 177-213, 224-238, 258-272, 307-321, 332-346, 419-434, 454- 
468, 512-534, 561-595, 619-641, 645-680, 691-710, 781-795, 983-997, 1043-1057, 
1065-1093, 1143-1185, 1296-1310, 1383-1397, 1453-1467, 1567-1597, 1653-1667, 
1693-1707, 1740-1754, 1774-1836, 1855-1873, 1929-1943, 2066-2080, 2108-2122, 
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2130-2156, 2174-2188, 2206-2220, 2282-2296, 2309-2323, 2439-2453, 2552-2566, 
2578-2592, 2625-2639, 2644-2673, 2697-2714, 2719-2733, 2744-2774, 2802-2816, 
2918-2943, 2971-2985. 

5 For SEQ ID NO,: 4 predicted B-cell epitopes are defined as, but not limited to amino acid 
no. 30-65, 89-113, 125-142, 213-272, 390-432, 491-514, 733-792, 833-851, 874-904, 
1040-1069, 1128-1170, 1187-1211, 1258-1276, 1306-1365, 1394-1406, 1441-1499, 
1542-1571, 1648-1678, 1884-1908, 1949-1979, 2268-2309, 2345.2369, 2392-2416, 
2853-2906. 

10 

For SEQ ID NO.: 6 predicted T-cell epitopes are defined as, but not limited to amino acid 
no. 14-28, 43-57, 71-85, 222-248, 251-265, 301-315, 318-332, 359-373, 416-443, 446- 
460, 471-485, 518-532, 579-593, 624-643, 783-797, 949-967, 986-1000, 1016-1030, 
1075-1093, 1141-1155, 1265-1279, 1340-1372, 1440-1471, 1505-1515, 1529-1556, 
15 1634-1648, 1659-1673, 1687-1701, 1712-1726, 1813-1827, 1841-1855, 1875-1995, 
2092-2110, 2137-2151, 2161-2171, 2249-2263, 2319-2333, 2379-2393, 2475-2495, 
2502-2516, 2549-2564, 2591-2605, 2629-2643, 2650-2669, 2674-2688, 2734-2766, 
2916-2930, 2989-3018, 3038-3052, 3115-3129, 3131-3145, 3167-3187, 3248-3262, 
3323-3337. 

20 

For SEQ ID NO.: 6 predicted B-cell epitopes are defined as, but not limited to amino acid 
no. 7-27, 383-429, 627-786, 898-990, 1089-1221, 1307-1340, 1373-1525, 1670-1815, 
1901-1934, 2007-2027, 2225-2251, 2363-2409, 2495-2528, 2614-2660, 2706-2911, 
3221-3300. 

25 

Some characteristic structures lie within the polypeptide sequence of SEQ ID NO.: 2, SEQ 
ID NO.: 4, and SEQ ID NO.: 6 and therefore also within the nucleotide sequence encoding 
these potypeptide sequences. Such structures comprise, but are not necessarily limited to 
DBL2, DBL5, and C2. 

30 

Preferred embodiments of the present invention include specific sub-sequences of the 
polypeptide of the invention having a minimum length of 10 amino acids such as sub- 
sequences that are at least 100 amino acids long. In even more preferred embodiments of 
the invention, these sub-sequences can be shown by known molecular biological 
35 techniques to be involved in the interaction with endothelial receptors and/or to be 
recognised by antibodies using plasma from young semi-immune children. 

It is anticipated that relatively short sequences within the SEQ ID NO.: 2, SEQ ID NO.: 4 
and/or SEQ ID NO.: 6 are responsible for mediating adhesion to endothelial receptors 
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other than CD36. In particular, it is possible that certain CIDR or DBL domains or parts 
hereof are responsible for the adhesion and/or for antibody recognition. 

In other preferred embodiments of the Invention, the sequences of the polypeptide of the 
5 invention can be shown to possess one or more antigen epitopes. In particular, such 
epitopes may be B-cell epitopes. Optionally, the sub-sequences may also comprise one or 
more T-cell epitopes alone or in combination with the B-cell epitopes. Finally, also larger 
polypeptides comprising the polypeptide of the Invention or sub-sequences hereof with 
antigen epitopes and/or sequences involved in interaction with endothelial receptors are 
10 embodiments of the present invention. 

It Is also apparent that the polypeptide sequences of the invention can be present In the 
form of fusion proteins. In a further preferred embodiment, this fusion protein will 
comprise polypeptide sequences, which will facilitate the purification or detection of the 
15 protein. These polypeptide sequences may be but are not limited to tags that will facilitate 
purification and detection using commercially available systems such as the HA- ,-c-myc. 
His or GST tags. 

The polypeptide embodiments of the present invention can therefore exhibit a vast degree 
20 of sequence identity to the full-length of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID 

NO.: 6. It can for instance be appreciated that a fusion protein carrying within its sequence 
one or more B-cell epitopes and or regions of the polypeptide of the invention that are 
involved in adhesion to endothelial receptors will have a relatively low overall degree of 
sequence identity to full-length SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6. 

25 

Sequence identity of the polypeptide molecules 

For all the aspects of the invention. It is thus apparent that the polypeptides of the 
invention may include sequences, which show anywhere between 40-100% sequence 
Identity, such as at least 41%, at least 42%, at least 43%, at least 44%, at least 45%, at 

30 least 46%, at least 47%, at least 48%, at least 49% at least 50%, at least 55%, at least 
60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, 
at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at 
least 89%, at least 90%at least 91%, at least 92%, at least 93%, at least 94%, at least 
95%, at least 96%, at least 97%, at least 98%, at least 99 % or preferably 100% 

35 sequence Identity to SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 or a fragment or 
sub-sequence thereof. 
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As understood by the skFlled adressee, the sequence identity of the polypeptide molecules 
of the present invention will be higher the smaller the fragments or sub-sequences in order 
to be unique, thus in some embodiments of the present Invention^ the fragments or sub- 
sequences are of 20-50 amino acid long and has a sequence identity of 80-100%, such as 
5 20-50 amino acid long, 20-40 amino acid long, 20-30 amino acid long, 20-25 amino acid 
long, 30-50 amino acid long, 30-40 amino acid long, 30-35 amino acid long, 40-50 amino 
acid long, 40-45 or 45-50 amino acid long ail with a sequence Identity of 80-100%, such 
as 80% identical, 81% identical, 82% identical, 83% identical, 84% identical, 85% 
identical, 86% identical, 87% identical, 88% Identical, 89% Identical, 90% identical, 91% 
10 identical, 92% identical, 93% identical, 94% identical, 95% identical, 96% identical, 97% 
identical, 98% identical, 99% identical or even 100% identical. 

In other embodiments of the present invention, the fragments or sub-sequences are of 51- 
750 amino acid long and has a sequence identity of 70-100%, such as 51-750 amino acid 

15 long, 60-700 amino acid long, 70-600 amino acid long, 80-500 amino acid long, 90-400 
amino acid long, 100-250 ammo add long, 300-350 amino acid long, 400-500 amino acid 
long, 200-650 or 450-500 amino acid long all with a sequence identity of 70-100%, such 
as 70% identical, 71% identical, 72% identical, 73% identical, 74% identical, 75% 
identical, 80% identical, 85% identical, 88% identical, 89% identical, 90% identical, 91% 

20 identical, 92% identical, 93% identical, 94% identical, 95% identical, 96% identical, 97% 
identical, 98% identical, 99% identical or even 100% identical. 

In yet other embodiments of the present Invention, the fragments or sub-sequences are of 
more than 751 amino acid long and has a sequence Identity of 60%- 100%, such as more 

25 751 amino acid long, more 800 amino acid long, more than 900 amino acid long, more 
than 1000 amino acid long, more than 1100 amino acid long, more than 1200 amino acid 
long, more than 1300 amino acid long, more than 1400 amino acid long, more than 2000 
amino acids long or 2500 amino acid long all with a sequence identity of 60-100%, such as 
60% identical, 61% identical, 62% Identical, 63% Identical, 64% identical, 65% identical, 

30 70% identical, 75% identical, 77% Identical, 80% Identical, 85% identical, 88% Identical, 
92% identical, 93% Identical, 94% identical, 95% identical, 96% identical, 97% identical, 
98% Identical, 99% identical or even 100% identical. 

Additionally, variants are also an embodiment of the present invention and are determined 
35 based on a predetermined number of conservative amino acid substitutions as defined 
herein below. Conservative amino acid substitution as used herein relates to the 
substitution of one amino acid (within a predetermined group of amino acids) for another 
amino acid (within the same group), wherein the amino acids exhibit similar or 
substantially similar characteristics. 
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Within tlie meaning of the term "conservative amino acid substitution" as applied herein, 
one amino acid may be substituted for another within the groups of amino adds indicated 
herein below: 

5 

Amino adds having polar side chains (Asp, Glu, Lys, Arg, His, Asn, Gin, Ser, Thr, Tyr, and 

Cys,) 

Amino acids having non-polar side chains (Gly, Ala, Val, Leu, He, Phe, Trp, Pro, and Met) 

Amino acids having aliphatic side chains (Gly, Ala Val, Leu, He) 
10 Ammo acids having cyclic side chains (Phe, Tyr, Trp, His, Pro) 

Amino acids having aromatic side chains (Phe, Tyr, Trp) 

Amino acids having acidic side chains (Asp, Glu) 

Amino adds having basic side chains (Lys, Arg, His) 

Amino acids having amide side chains (Asn, Gin) 
15 Amino adds having hydroxy side chains (Ser, Thr) 

Amino acids having sulphor-containing side chains (Cys, Met), 

Neutral, weakly hydrophobic amino acids (Pro, Ala, Gly, Ser, Thr) 

Hydrophilic, addle amino adds (Gin, Asn, Glu, Asp), and 

Hydrophobic amino acids (Leu, He, Val) 

20 

Preferred conservative amino acids substitution groups are: valine-leucine-isoteucine, 
phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine. 

Accordingly, a variant or a fragment thereof according to the invention may comprise, 
25 within the same variant of the sequence or fragments thereof, or among different variants 
of the sequence or fragments thereof, at least one substitution, such as a plurality of 
substitutions introduced independently of one another. 

It is dear from the above outline that the same variant or fragment thereof may comprise 
30 more than one conservative amino acid substitution from more than one group of 
conservative amino acids as defined herein above, 

The present invention further relates to addition or deletion of at least one amino acid in 
relation to any of the amino acid sequences according to the invention and may be an 
35 addition or deletion of from preferably 2 to 50 amino adds, such as from 10 to 20 amino 
acids, for example from 20 to 30 amino acids, such as from 40 to 50 amino acids. 

However, additions or deletions of more than 50 amino acids, such as additions from 50 to 
100 amino acids, addition of 100 to 150 amino adds, addition of 150-250 amino adds, are 
40 also comprised within the present invention. Even additions or deletions of 500-2000 

amino acids are within the scope of the present invention. The deletion and/or the addition 
may - Independently of one another - be a deletion and/or an addition within a sequence 
and/or at the end of a sequence. 
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The polypeptide fragments according to the present invention, including any functional 
equivalents thereof, may in one embodiment comprise less than 250 amino acid residues, 
such as less than 240 amino acid residues, for example less than 225 amino acid residues, 
such as less than 200 amino acid residues, for example less than 180 amino acid residues, 
5 such as less than 160 amino acid residues, for example less than 150 amino acid residues, 
such as less than 140 amino acid residues, for example less than 130 amino acid residues, 
such as less than 120 ammo acid residues, for example less than 110 amino acid residues, 
such as less than 100 amino acrd residues, for example less than 90 amino acid residues, 
such as iess than 85 amino acid residues, for example less than 80 amino acid residues, 
10 such as less than 75 amino acid residues, for example less than 70 amino acid residues, 
such as less than 65 amino acid residues, for example less than 60 amino acid residues, 
such as less than 55 amino acid residues, for example less than 50 amino acid residues. 

'Tunctlonal equivalency"^ as used in the present invention Is according to one preferred 
15 embodiment established by means of reference to the corresponding functionality of a 
predetermined fragment of the sequence. 

It is understood that the polypeptide fragments of the invention may possess one or more 
types of post-translational modifications when expressed on the cell surface. These 
20 modifications may comprise, but are not limited to, glycosylation, phosphorylation, 
acylation, cross-linking, proteolytic cleavage, linkage to an antibody molecule, a 
membrane molecule, or another ligand. 

It is an object of presently preferred embodiments of the present invention that the 
25 polypeptide comprises the amino acid sequence shown in SEQ ID NO.: 2, SEQ ID NO.: 
4or SEQ ID NO.: 6. 

It is an object of presently most preferred embodiments of the present invention that 
the polypeptide consists of the amino acid sequence shown rn SEQ ID NO.: 2, SEQ ID 
30 NO.: 4 or SEQ ID NO.: 6. 

Nucleic acid molecules 

The embodiments of the present invention thus relate to polypeptides of the PfEMPl class 
or sub-sequences hereof as well as nucleic acid molecules encoding such polypeptides or 
35 sub-sequences, wherein said polypeptides and sub-sequences comprise structures that are 
involved directly or Indirectly in the binding to endothelial receptors and/or recognised by 
plasma antibodies from young semi-immune children. 

The PF01235W/MAL7R1.1, PF11_0008, and PF13_0003 genes presented in SEQ ID NO.: 1, 
40 SEQ ID NO.: 3, and/or SEQ ID NO.: 5 are members of Group A var genes and, in their 

widest perspective, the embodiments of the Invention thus relate to nucleic acid molecules, 
which are characteristic In that they do not belong to the varl, var2 gene subfamily as 
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defined in Salanti et al. 2002 and Salanti et al. 2003 or Group B, Group and Group B/C, 
but does not exclude Group B/A var genes as defined in Lavstsen et al. 2003. 

Furthermore, nucleic acid molecules, which are complementary to the nucleic acid 
5 molecules of the invention as described above as well as polypeptides encoded by these 
nucleic acid molecules are within the scope of the invention. 

One embodiment of the present mventlon relates to a nucleic acid molecule comprising at 
least one nucleotide sequence selected from the group consisting of at least one of 

10 

a) SEQ IN NO.: 1, SEQ ID NO.: 3 and SEQ ID NO.: 5 or a sequence 
complementary thereof; and 

b) a nucleotide sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 nucloetides; and 
i5 d) sub-sequences of a) or b) which comprises at least one sequence 

encoding a B-cell epitope; 

As understood by the skilled adressee, the sequence identity of the nucleic acid molecules 
of the present invention will be higher the smaller the fragments or sub-sequences in order 

20 to be unique, thus in some embodiments of the present invention, the fragments or sub- 
sequences are of 60-150 nucleic acid long and has a sequence identity of 80-100%, such 
as 60-140 nucleic acid long, 60-130 nucleic acid long, 60-120 nucleic acid long, 60-110 
nucleic acid long, 75-150 nucleic acid long, 80-140 nucleic acid long, 90-135 nucleic acid 
long, 90-150 nucleic add long, 120-130 or 125-150 nucleic acid long all with a sequence 

25 identity of 80-100%, such as 80% identical, 81% identical, 82% identical, 83% identical, 
84% identical, 85% identical, 86% identical, 87% identical, 88% identical, 89% identical, 
90% identical, 91% Identical, 92% Identical, 93% Identical, 94% Identical, 95% Identical, 
96% identical, 97% identical, 98% identical, 99% identical or even 100% identical. 

30 In other embodiments of the present invention, the fragments or sub-sequences are of 
151-2200 nucleic acid long and has a sequence identity of 70-100%, such as 151-2100 
nucleic acid long, 200-1700 nucleic acid long, 300-1600 nucleic acid long, 400-1500 
nucleic acid long, 500-1400 nucleic acid long, 1000-1250 nucleic acid long, 1300-1350 
nucleic acid long, 1400-1500 nucleic acid long, 200-1650 or 450-2000 nucleic acid long all 

35 with a sequence identity of 70-100%, such as 70% identical, 71% identical, 72% identical, 
73% identical, 74% Identical, 75% identical, 80% Identical, 85% identical, 88% identical, 
89% identical, 90% identical, 91% identical, 92% identical, 93% identical, 94% identical, 
95% identical, 96% identical, 97% identical, 98% identical, 99% Identical or even 100% 
identical. 



40 
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In yet other embodiments of the present Invention, the fragments or sub-sequences are of 
more than 2201 nucleic acid long and has a sequence Identity of 60%- 100%, such as more 
2201 nucleic acid long, more 2300 nucleic acid long, more than 2400 nucleic acid long, 
more than 3500 nucleic acid long, more than 4000 nucleic acid long, more than 5000 
5 nucleic acid long, more than 5500 nucleic add long, more than 6000 nucleic acid long, 
more than 7000 nucleic acids long or 8500 nucleic acid long all with a sequence identity of 
60-100%, such as 60% identical, 61% identical, 62% Identical, 63% identical, 64% 
identical, 65% Identical, 70% identical, 75% identical, 77% identical, 80% identical, 85% 
identical, 88% identical, 92% identical, 93% identical, 94% identical, 95% identical, 96% 
10 identical, 97% identical, 98% identical, 99% identical or even 100% IdenticaK 

With the proviso that the Exon 2 of SEQ ID NO. : 1, 3 and/or 5 is excluded 

With the proviso that the "Fragment 1" and/or ''Fragment 2" and/or ''Fragment 3" and/or 
15 "Fragment 4" and /or "Fragment 5" and/or "Fragment 6" and/or "Fragment 7" and/or 
''Fragment 8" and/or "Fragment 9" and/or "Fragment 10" and/or "Fragment 11" and/or 
"Fragment 12" and/or "Fragment 13" and/or "Fragment 14" and/or "Fragment 15" and/or 
"Fragment 16" and/or "Fragment 17" "Fragment 18" and/or "Fragment 19" and/or 
"Fragment 20" and/or "Fragment 21" and/or "Fragment 22" and/or "Fragment 23" and/or 
20 "Fragment 24" and/or "Fragment 25" and/or "Fragment 26" of SEQ ID NO.: 1, SEQ ID 
NO.: 3, and/or SEQ ID NO.: 5 is excluded. 

With the proviso that the "Fragment 27" and/or "Fragment 28" and/or "Fragment 29" 
and/or "Fragment 30" and/or of SEQ ID NO.: 1 and/or SEQ ID NO.: 5 is excluded. 

25 

With the proviso that the "Fragment 31" and/or "Fragment 32" of SEQ ID NO.: 1 is 
excluded. 

With the proviso that the nucleotide sequence of SEQ ID NO.: 1, SEQ ID NO.: 3 and/or 
30 SEQ ID NO.: 5 is excluded. 

The cDNA sequence encoding SEQ ID NO. 2, SEQ ID NO.: 4, and /or SEQ ID NO.: 6 In the 
parasite line 3D7 Is provided in the sequence listing as SEQ ID NO.: 1, SEQ ID NO-: 3 
and/or SEQ ID NO.: 5. 

35 

Again, it is apparent for ail the aspects of the Invention that the nucleic acid molecufes of 
the invention may be less than or equal to any length between 30-10662 (SEQ ID NO.: 1), 
30-8985 (SEQ ID NO.: 3) or 30-10041 (SEQ ID NO.: 5) nucleotides, such as less than or 
equal to 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 
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51, 52, S3, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 
74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 
97, 98, 99, 100 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 225, 250, 275, 300, 
350, 400, 450, 500, 600, 700, 800, 900, 1000, 1250, 1500, 1750, 2000, 2500, 3000, 
5 3500, 4000, 4500, 5000, 6000, 7000, 8000, , 8100, 8200, 8300, 8400, 8500, 8600, 8700, 
8800, 8900, 8955, 8956, 8957, 8958, 8959, 8960, 8961, 8962, 8963, 8964, 8965, 8966, 
8967, 8968, 8969, 8970, 8971, 8972, 8973, 8974, 8975, 8976, 8977, 8978, 8979, 8980, 
8981, 8982, 8983, 8984, 8985 nucleotides in length for SEQ ID NO.: 3 and further 9000, 
9100, 9200, 9300, 9400, 9500, 9600, 9700, 9800, 9900, 10000, 10011, 10021, 10031, 
10 10041 nucleotides In length for SEQ ID NO.: 5, and furthermore 1005O, 10100, 10200, 
10300, 10400, 10500, 10600, 10650, 10651, 10652, 10653, 10654, 10655, 10656, 
10657, 10658, 10659, or 10660 nucleotides in length for SEQ ID NO.: 1. 

Still with respect to all aspects of the invention It may be preferred that the nucleic acid 
15 molecules of the invention may have a length of 30 - 40, 30 - 50, 30 - 60, 30 - 70, 30 - 
80, 30 - 90, 30 - 100, 30 - 200, 30 - 400, 30 - 500, 30 - 1000, 30 - 1200, 30 - 1300, 30 - 
1400, 30 - 1500, 30 - 1600, 30 - 1700, 30 - 1800, 30 - 1900, 30 - 2000, 30 - 2250, 30 - 
2500, 30 - 2750, 30 - 3000, 30 - 3500, 30 - 4000, 30 - 4500, 30 - 5000, 30 - 6000, 30 - 
7000, 30 - 8000, 30 - 9000 nucleotides. 

20 

In preferred embodiments of the invention, sub-sequences of the nucleic acid molecules of 
the invention have a minimum length of 30 nucleotides and in even more preferred 
embodiments these sub-sequences are at least 300 nucleotides long. 

Sequence identity of the nucleic acid molecules 

25 

Preferred nucleic acid embodiments further Include nucleic acids encoding fragments of the 
polypeptide of the invention that are involved in interaction with endothelial receptors. In 
addition. It is an object of preferred embodiments that sub-sequences of the nucleic acid 
molecule of the invention comprise nucleotides encoding one or more B-cell epitopes 
30 and/or one or more T-cell epitopes. 

Some characteristic structures lie within the peptide sequence of SEQ ID NO,: 2, SEQ ID 
NO.: 4 and/or SEQ ID NO.: 6 and therefore also within the nucleotide sequence encoding 
this peptide sequence. Such structures comprise, but are not necessarily limited to, a 
35 DBL2P domain followed by a C2 domain. On the other hand, some common features have 
been identified for proteins encoded by Group B, Group C, and Group B/C var genes 
including, but not limited to the DBL25 domain. These features are not found within the 
amino acid sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6. 
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Further embodiments comprise nucleic acid molecules that complement full-length of SEQ 
ID NO-: 1, SEQ ID NO.: 3 and/or SEQ ID NO.: 5 or sequences identical in part hereto as 
well as nucleotide sequences that complement fragments of full-length SEQ ID NO.: 1, 
5 SEQ ID NO.: 3 and/or SEQ ID NO.: 5, or sequences identical in part hereto. 

Preferred complementary nucleic acid molecules of the invention comprise nucleic acid 
molecules that are complementary to fragments of SEQ ID NO,: 1, SEQ ID NO,: 3 and/or 
SEQ ID NO.: 5, which have a nucleotide sequence that encodes a polypeptide or parts of a 
10 polypeptide that are involved in interaction with endothelial receptors. Additionally^ 

preferred complementary nucleic acid molecules of the invention are complementary to 
sequences encoding one or more B-cell epitopes and/or one or more T-cell epitopes. 

As discussed for the polypeptide-based compounds of the invention it is also apparent that 
15 the nucleotide based embodiments may represent only part of the full-length sequence. In 
addition these nucleotide sequences may be present in combination with exogenous 
sequences. For all the aspects of the invention, it is thus apparent that the nucleic acids 
molecules of the invention may include sequences that have anywhere between 1-100% 
sequence identity to the full-length sequence of SEQ ID NO.: 1, SEQ ID NO.: 3 and/or SEQ 
20 ID NO.: 5, such as at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at 
least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at feast 
60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, 
at least 95%, at least 97% or, preferably, 100% sequence identity to SEQ ID NO.: 1, SEQ 
ID NO.: 3 and/or SEQ ID NO.: 5, or a fragment or sub-sequence thereof. 

25 

It is to be understood that the nucleotide sequence of SEQ ID NO.: 1, SEQ ID NO.: 3 
and/or SEQ ID NO.: 5, when present within the genome of the intact Plasmodium 
fafciparum parasites as well as the polypeptide sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 
and/or SEQ ID NO.: 6, when present in or on the surface of intact red blood cells Infected 
30 with P. falciparum are excluded from the scope of the present invention. This applies to all 
embodiments of the invention described in the present application. Compounds of the 
invention may however comprise sub-sequences of SEQ ID NO.: 1, SEQ ID NO.: 3 and/or 
SEQ ID NO.: 5 and sub-sequences of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 
Isolated and/or purified from the Plasmodium parasites or infected RBC. 

35 Vectors 

In addition, recombinant polypeptides comprising sub-sequences of the amino acid 
sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 may be generated by 
use of the above-mentioned nucleic acid embodiments. These can be cloned into vectors 
by the use of cloning techniques known In the art. The sequence encoding the polypeptide 
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of interest Is thereby linked to a heterologous promoter sequence. It may be preferred to 
optimise the codon context and codon pairing for the particular expression system. With 
respect to the polypeptide embodiments of the invention the incorporation of a secretory 
leader sequence may also be of use. The vector can be an expression vector in any of the 
5 mammalian, yeast, amphibian, insect, parasite, or bacterial expression systems known In 
the art. It is therefore apparent that, with the exception of Plasmodium infected RBC, 
prokaryotic and eukaryotic cells hereunder mammalian cells and transformed cell lines as 
well as cells In animals possessing nucleotide and/or amino acid embodiments described 
herein, are within the scope of the present invention. 

10 

Cells and cell lines 

Propagation of such cells or cell lines may be performed with the intention of providing 
recombinant forms of one or more of the nucleic acid or polypeptide embodiments of the 
invention In amounts that are sufficient for further processing or purification. It is therefore 

15 within the scope of the present Invention to provide preparations of compounds, which 
comprise polypeptides of the invention as well as nucleic acid molecules encoding these 
polypeptides. Preparations of such compounds may have a desired degree of purity 
referring to the relative amounts of the desired polypeptide and for Instance whole cell 
proteins and unwanted variants of the desired polypeptide as defined above. The existence 

20 of a wide range of protein purification and concentration techniques is known to the skilled 
artisan. These techniques include gel electrophoresis. Ion-exchange chromatography, 
affinity and immunoaffinity chromatography, ceramic hydroxyapatite chromatography, 
differential precipitation, molecular sieve chromatography, isoelectric focusing, gel 
filtration, and diafiltration. 

25 

For the various types of chromatography, the desired molecules are suspended in a buffer, 
which promotes adhesion of the molecules to the active surface of the resin and are then 
applied to the chromatography column. Removal of contaminants is performed by washing 
the resin in a buffer of intermediate ionic strength or pH. Elution of the desired molecules 

3D is performed by changing the ionic strength or pH of the buffer to values that will promote 
the dissociation of the molecules from the active surface of the resin used. In the case of 
Immunoaffinity chromatography, the polypeptide may be purified by passage through a 
column containing a resin to which Is bound antibodies which are specific for at least a 
portion of the polypeptide. Furthermore, His- or GST tags may be added to the 

35 polypeptides of the invention. Subsequently, the resulting fusion proteins can be purified 
by affinity chromatography on for instance glutathione sepharose 4B and HIS tag Metal 
Chelate Affinity Chromatography. 
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It IS readily apparent that a person skilled in the art can create nucleic acid molecules of 
virtually any length by llgating a nucleic add molecule encoding any of the amino acid 
sequences of the present invention or any part thereof to an exogenous nucleotide 
sequence. Recombinant nucleic acid molecules generated by this approach are 
5 embodiments of the invention. A recombinant construct can be capable of replicating 
autonomously within a host cell or, alternatively, it can become integrated Into the 
chromosomal DNA. Such a recombinant nucleic acid molecule can comprise a sequence of 
genomic DNA^ cDNA, synthetic or semi-synthetic origin. Again, it is preferred that such 
nucleic acid molecules are encoding one or more B-cell epitopes and/or one or more T-cell 
10 epitopes. 

The nucleic acid embodiments of the present invention can be altered by genetic 
engineering so as to introduce substitutions, deletions and/or additions. In preferred 
embodiments of the invention, these alterations will provide for sequences encoding 
15 functionally equivalent molecules or molecules with the same or improved properties. Such 
changes of the polypeptide embodiments can be generated using techniques that are 
known to a person skilled in the art, including random mutagenesis and site-directed 
mutagenesis. 

20 The use of recombinant polypeptides of the Invention may be preferred when it is required 
that the preparations of these polypeptides are essentially free of any other antigen with 
which they are natively associated, i.e. free of any other antigen from Plasmodium 
parasites. As an alternative this may also be accomplished by synthesizing the polypeptide 
fragments by the well-known methods of solid or liquid phase peptide synthesis. 

25 

Pharmaceutical compositions 

Additional aspects of the present invention relate to pharmaceutical compositions based on 
any of the polypeptide embodiments of the invention. In a presently preferred embodiment 
these pharmaceutical compositions relates to compositions capable of eliciting an immune 
30 response, which may or may not be in form of a vaccine. Preferably, such a composition 
comprises at least one amino acid sequence according to the present invention. 

It is an object of presently preferred embodiments of the present invention to provide such 
an amino acid sequence selected from the group consisting of at least one of 

35 

a) SEQ ID NO.: 2, SEQ ID NO.: 4 and SEQ ID NO.: 6 and 

b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 
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d) sub-sequences of a) or b) comprising at least one B-celi epitope. 

Alternatively, the pharmaceutical composition according to the present Invention may be 
based on any of the nucleotide embodiments of the Invention. In a preferred embodiment, 
5 the pharmaceutical composition comprises at least one nucleotide sequence selected from 
the group consisting of at least one of 

a) SEQ ID NO.: 1, SEQ ID NO.: 3 and SEQ ID NO.: 5 or a sequence 
complementary thereof; and 

b) a nucleotide sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 nucleotides; and 

d) sub-sequences of a) and b) which comprise at least one sequence 
encoding a B-cell epitope, 

15 Such compositions may or may not further comprise a vector containing said amino add or 
nucleotide sequence. 

In a specially preferred embodiment, any of the pharmaceutical compositions described in 
the present application may further comprise a pharmaceutically acceptable carrier and/or 
20 an adjuvant. 

Peptides, proteins and carbohydrates antigens are usually pooriy Immunogenic, or not 
immunogenic at all, when administered on their own. Better delivery systems and 
adjuvants may be needed to improve the ease of delivery and Immunogeniclty. In 
25 partiicular. In vivo Immunisation with peptides and proteins to generate MHC class I specific 
response has been difficult. One possible technique to solve this is the use of ISCOMs, 
which are lipid carriers that act as adjuvants but have minimal toxicity, and they appear to 
load peptide and proteins into the cell cytoplasma, allowing MHC class I restricted T-cell 
response to these peptides, which make them useful in human immunisation. 

30 

Pharmaceutical compositions comprising the nucleotide and polypeptide embodiments of 
the invention can be produced by conventional techniques so that the said sequences are 
present as monomeric, multlmeric or muitimerlsed agents. Furthermore, antibodies 
generated from the polypeptide embodiments of the invention may constitute part of such 
35 pharmaceutical compositions. In addition to the active ingredients, pharmaceutical 
compositions may further comprise one or more physiologically acceptable carriers, 
proteins, supports, adjuvants as well as components that may facilitate the delivery of the 
active components of the compositions. As described above, a large number of adjuvants 
are available Including but not limited to Freund^s adjuvant, mineral gels such as 
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aluminium hydroxide, and surface-active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol. As a 
pharmaceutical composition, the nucleic acid and peptide embodiments of the Invention 
will be purified and processed through one or more formulation steps. A large variety of 
5 formulation buffers will be physiologically acceptable, such as phosphate, citrate, and other 
organic acids. 

It is further understood that a pharmaceutical composition must be clinically safe. More 
specifically. It must be free of virus and bacteria that can cause Infection upon 
10 administration of the composition to a subject. It may therefore be necessary to process 
the composition through on or more steps of virus filtration and/or Inactlvatlon, The 
removal of virus by filtration can be obtained by passing the composition through a 
nanofllter, whereas virus Inactlvation can be accomplished by the addition of various 
detergents and/or solvents or other antiviral compounds to the composition. 

15 

Antibodies 

The polypeptide and/or nucleic acid embodiments of the Invention may be used in their 
purified form to generate various types of antibodies, and it is understood that such 
antibodies will also be considered as compounds of the invention. These antibodies may 

20 Include, but are not limited to polyclonal, monoclonal, chimeric, single chain. Fab 

fragments and fragments produced by a Fab expression library. A person sidlled in the art 
Icnows that antibodies can be produced by e.g. immunisation of various hosts including 
goats, rabbits, rats, and mice. The term 'immunisation' refers to the injection of a 
polypeptide with immunogenic properties. Depending on the host species various types of 

25 adjuvants can be used in order to increase the immunological response including but not 
limited to Freund's adjuvant, mineral gels such as aluminium hydroxide, and surface-active 
substances such as lysolecithin, pluronic polyois, potyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, and dinitrophenol. 

30 It is preferred to use shorter sequences of the polypeptide of the invention fused to a 
powerful immunogenic molecule such as keyhole limpet hemocyanin resulting In the 
production of antibodies against this chimeric molecule. Accordingly, antibodies capable of 
recognising SEQ ID NO. 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 can be produced by 
injection of synthetic peptides consisting of as little as 3 to 30 amino. Thus, it is an object 

35 of preferred embodiments of the present invention to provide such small synthetic peptides 
of3-250 amino acids, such as but not limited to 15-150 amino acids, 20-140 amino acids, 
30-130 amino acids, 40-120 amino acids, 50-110 amino acids, 60-100 amino acids, 70-90 
amino acids, 80-85 amino acids corresponding to a particular sequence of the SEQ ID NO. 
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2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 polypeptides. As an alternative, a more diverse set 
of antibodies can be generated by injection of a purified polypeptide embodiment of the 
invention. 

5 Monoclonal antibodies directed against any of the polypeptides of the present Invention, 
such as a purified polypeptide embodiment of the invention, can be produced using any of 
the conventional techniques that provide for the production of antibodies from cell lines In 
continuous culture. These techniques include the hybridoma technique, the human B-cell 
hybridoma technique, and the EBV-hybridoma technique. 

10 

It will be readily appreciated that polypeptides of the invention can be incorporated into 
vaccines capable of Inducing protective immunity against a specific subtype of malaria. In 
relation to the present invention it is preferred that the vaccine is directed specifically 
against the infectious activity of Plasmodium falciparum in the brain or adhesion to 
15 endothelial cells, which is characteristic of severe malaria. 

One important aspect of the present invention therefore relates to a vaccine comprising 
one or more B-celi epitopes from a polypeptide encoded by any of the nucleotides 
according to the present invention. This vaccine is characterised in that it induces an 
20 antibody response wherein said antibody specifically recognises a molecule expressed on 
the surface of an intact erythrocyte infected by VSAsm type parasites or parasites that have 
been selected for their ability to mediate adhesion to endothelial receptors. Generally, this 
molecule is recognised by the antibodies from semi-Immune children or children suffering 
from severe malaria. 

25 

In preferred embodiments, antibodies directed against the polypeptides and/or nucleic 
acids of the invention can be administered to a subject in order to provide protection 
against the retention and sequestration of iRBC to endothelial cells which is characteristic 
of severe malaria. 

30 

Effective amounts of an agent that will promote an immune response against a compound 
of the present invention can be administered to subjects living in endemic areas so as to 
prevent the contraction of malaria. 

35 In another embodiment, a subject believed to be at risk for contracting malaria may be 
identified either by conventional methods or by one of the In vitro diagnostic techniques, 
which constitute other embodiments of the present Invention. An effective amount of an 
agent that inhibits SEQ ID NO.: 2, SEQ ID NO.: 4, SEQ ID NO.: 6 and/or homologues 
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hereof mediated sequestration or elicits an immune response In a subject can then be 
administered to this subject. 

Another embodiment of tlie present invention relates to an Isolated antibody or isolated 
5 antiserum induced in response to one or more polypeptides according to the invention 
and/or to one or more nucleic acids as defined in the present application. 

Another embodiment of the present invention relates to an isolated antibody or isolated 
antiserum induced in response to one or more polypeptides according to the invention 
10 and/or to one or more nucleic adds as defined In the present application, wherein said 
antibody Is capable of binding to a molecule expressed on the surface of an intact 
erythrocyte infected by a parasite causing malaria. 

Another embodiment of the present invention relates to an isolated antibody or isolated 
15 antiserum induced in response to one or more polypeptides according to the invention 
and/or to one or more nucleic acids as defined in the present application, wherein said 
antibody is capable of recognising paracites selected in vitro for expression of VSAsm- 

A presently preferred embodiment relates to an antibody as described by the present 
20 application, wherein said antibody is capable of binding to a molecule expressed on the 
surface of an intact erythrocyte infected by a parasite capable of mediating cyto-adhesion 
of intact erythrocyte infected by a parasite to human endothelial ceils, but not the CD36 
receptor. 

25 Vaccines 

Additional aspects of the present Invention relate to vaccines based on any of the 
embodiments of the invention, such as but not limited to a vaccine comprising any of the 
polypeptides, the nucleic acids or the recombinant vectors according to the present 
invention, said vaccine characterised in that It induces an immune response, wherein said 
30 immune response specifically recognises a molecule expressed on the surface of an intact 
erythrocyte infected by a parasites. 

In one embodiment such a vaccine relates to a vaccine comprising one or more B-cell 
and/or T-cefl epitopes originating from any of the polypeptides, the nucleic acids or the 
35 recombinant vectors of the present invention, said vaccine characterised in that it induces 
an immune response, wherein said immune response specifically recognises a molecule 
expressed on the surfece of an intact erythrocyte infected by a parasites. 
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One presently preferred embodiment of the present invention relates to a polypeptide- 
based vaccine connprises at least one amino acid sequence selected from the group 
consisting of at least one of 

5 a) SEQ ID NO,: 2, SEQ ID NO.: 4 and/or SEQ ID NO,: 6 and 

b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a mlnjmum length of 30 amino acids; and 

d) sub-sequences of a) or b) comprising at least one B-cell epitope. 

10 Sub-sequences of the polypeptide of the invention used in a vaccine may be any of the 
above mentioned amino acid lengths and In addition to these fragments or sub-sequences 
of the polypeptide of the invention, larger polypeptides comprising sub-sequences of the 
invention as part of their sequence, are also embodiments of the present invention. It is 
preferred, however, that these sub-sequences have a minimum length of 30 amino acids 

15 and that they are at least 80% identical to a region of comparable length within the 
sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6. 

In recent years there has been Increased focus on nucleic acid vaccines. Other aspects of 
the present Invention therefore concern genetic immunisation In which nucleotide based 

20 vaccines such as vaccines based on DNA molecules or on RNA molecules, which result in 
the expression of one or more B-cell and/or T-cell epitopes from a polypeptide encoded by 
a member of the SEQ ID NO.: 1, 3, and/or 5 gene family. As for the polypeptide based 
vaccine this vaccine Is characterised in that it induces an antibody response wherein said 
antibody specifically recognises a molecule expressed on the surface of an Intact 

25 erythrocyte Infected by VSAsm type parasites that have been selected for their ability to 
mediate adhesion to endothelial cells or for increased antibody recognition by plasma from 
young semi-immune children- 
One embodiment of the present Invention relates to a nucleotide based vaccine, such as a 

30 DNA vaccine, which results In the expression of a polypeptide comprising one or more B- 
cell and/or T cell epitopes from any of the polypeptide sequences of the present invention, 
wherein said vaccine Is capable of inducing an immune respons which specifically 
recognises a molecule expressed on the surface of an Intact erythrocyte infected by 
parasites, and wherein said parasites furthermore has a VSAsm phenotype that have been 

35 selected for antibody recognition or adhesion to endothelial receptors in vitro, and wherein 
said molecule is recognised by antibodies from young children living in areas of high 
malaria transmission intensity. 
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One embodiment relates to a DNA vaccine comprising at least one nucleic add sequences 
to the present Invention, wiierein said vaccine is capable of inducing an immune response, 
wherein said immune response specifically recognises a molecule expressed on the surface 
of an intact erythrocyte Infected by parasites. 

5 

Another embodiment relates to a natural, synthetic or recombinant DNA or RNA vaccine 
having any nucleotide sequence according to the present invention, wherein the vaccine is 
capable of eliciting development of ant\- Plasmodium antibodies. 

10 In a preferred embodiment, the present invention relates to a nucleotide-based vaccine, 
which may be a DNA or RNA vaccine, comprising a vector comprising at least one 
nucleotide sequence selected from the group consisting of at least one of 

a) SEQ ID NO.: 1, SEQ ID NO.: 3 and SEQ ID NO.: 5 or a sequence 
15 complementary thereof; and 

b) a nucleotide sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 nucleotides; and 

d) sub-sequences of a) and b) which comprise at least one sequence 
encoding a B-cell epitope. 

20 

The vaccine may thus comprise any of the sub-sequences of the nucleotide sequence of 
the invention, which may have any of the sequence Identities described above. It is 
preferred, however, that these sub-sequences have a minimum length of 30 nucleotides 
and that they are at least 80% identical to a region of comparable length within the 
25 sequence of SEQ ID NO.: 1, SEQ ID NO.: 3 and/or SEQ ID NO.: 5. 

According to this aspect of the invention one approach is to Incorporate the DNA encoding 
a polypeptide of the invention or parts hereof into a viral or bacterial vector. The following 
organisms, among numerous others, may be employed for this purpose: Coxsackie virus, 
30 vaccinia virus. Salmonella typhi or Salmonella typhimurium (for oral administration). In 
each case the carrier organism must be acquired by the host cell and the relevant DNA 
sequences used for production of the polypeptide of the invention or parts hereof- These in 
turn are recognised as abnormal by the host or recipient and an immune response ensues. 

35 Alternatively, the parasite nucleic acid sequence may be incorporated into an RNA virus or 
used to prepare viral replicons* This approach allows for the delivery of coding sequences, 
such as mRNA, to the host cell without risking a replicative, infectious process. 
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In order to obtain expression of immunogenic polypeptides it is required tliat elements of a 
nucleotide-based vaccine are capable of entering into the relevant target cells of the 
subject receiving such a vaccine. Therefore, preferred embodiments of the invention 
include vaccines, which further comprise one or more agents and/or vectors to facilitate 
5 such entry. 

In a further preferred embodiment, the vector component of a nucleotide-based vaccine 
comprises a promoter for driving the expression in a mammalian cell line, a nucleotide 
sequence encoding a leader peptide for facilitating secretion/release of a polypeptide 
10 sequence from a mammalian cell, and a terminator. 

The simple concept of a nucleotide-based vaccine is the inoculation of a recipient using the 
relevant DNA sequence alone. This 'naked DNA' approach avoids the administration of 
polypeptide directly, but its effectiveness depends on the ability of the host cell to utilise 
15 the injected DNA as a template for RNA and subsequent protein synthesis. 

It is anticipated that the principal value of providing SM-specifIc protective Immunity to 
sporozoite-induced Infection will be, but not limited to individuals who have not prevlousiy 
been exposed to malaria or have only had a few cases of malaria. In the case of severe 
20 malaria such individuals will be infants and young children in endemic areas, 

travellers/tourists, soldiers or Individuals from a non-malaria endemic area moving to or 
travelling in endemic areas. 

While not being limited by way of theory it is believed that the protection against malaria 
25 obtained by the use of a vaccine is most likely a result of IgGs blocking the interaction 
between the iRBC and endothelial receptors in various organs of the body such as, but not 
limited to the brain. It is a}so possible, however, that opsonized erythrocytes are killed by 
macrophages or T-cells, either by fjagocytosis or by other means. 

30 In a preferred embodiment, the vaccine is therefore capable of inducing an IgG response 
and, accordingly, it comprises a polypeptide comprising one or more B-cell epitopes. It is 
desirable, however, that polypeptides comprising one or more T-cell epitopes are also part 
of the vaccine since assistance from T-cells may be required in order to obtain a good 
antibody response. 

35 

In another preferred embodiment of the invention, the vaccine is therefore based on the 
use of polypeptides of the invention wherein said polypeptides comprises one or more B- 
cell epitopes in combination with one or more T-cell epitopes. In a another preferred 
embodiment of the invention, the vaccine comprises B-cell epitopes in combination with T- 
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celi epitopes originating from an exogenous molecule, and in an further preferred 
embodiment, the peptides of the vaccine comprises only B-cell epitopes. In equally 
preferred embodiments of the invention, the vaccine Is based on nucleotide sequences 
encoding polypeptides^ which have the characteristics with respect to antigen epitopes 
5 described above. 

Techniques exist for enhancing the antigenicity of immunogenic peptides including 
incorporation of these into a multimeric structure, binding to a highly immunogenic protein 
carrier, for example, keyhole limpet hemocyanin, or diptheria toxoid, and administration in 
10 combination with adjuvants or any other enhancers of immune response. Furthermore, it 
will be understood that polypeptides specific for a plurality of Plasmodium stages and 
species may be incorporated in the same vaccine composition to provide a multivalent 
vaccine. In addition, the vaccine composition may comprise antigens to provide immunity 
against other diseases in addition to malaria. 

15 

Immunogenic polypeptides of the invention as weil as nucleic acid molecules encoding such 
polypeptides may be injected as is, or for convenience of administration, it can be added to 
pharmaceutically acceptable carriers or diluents. Suitable pharmaceutically acceptable 
carriers will be apparent to those skilled in the art, and include water and other polar 
20 substances, including lower molecular weight alkanols, polyalkanols such as ethylene 
glycol, polyethylene glycol, and propylene glycol as well as non-polar carriers. 

Routes of administration, antigen dose, number and frequency of injections are all matters 
of optimisation within the scope of ordinary skilJ in the art, particularly in view of the fact 

25 that there is already experience In the art of providing protective Immunity by the mjection 
of Irradiated sporozoites. Protective antibodies are usually best elicited by a series of 2 to 3 
doses given about 2 to 3 weeks apart. The series can be repeated when concentrations of 
circulating antibodies in the vacclnee drops. The polypeptide is present in the vaccine in an 
amount sufficient to induce an immune response against the antigenic polypeptide and 

30 thus to protect against Plasmodium infection thereby protecting the subject against 
malaria. 

Vaccination protocols can include the identification of a subject in need of a vaccine, for 
instance Infants and young children living in regions populated with P. falciparum or non- 
35 immune individuals e.g. tourist/travellers and soldiers travelling through such regions, and 
administration of one or more effective doses of the vaccine to this subject. 

In some aspects, the present Invention can be used to both inhibit the adhesion of iRBC to 
endothelial receptors and to generate an immune response directed at SEQ ID NO. 2, SEQ 
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ID NO.: 4 and/or SEQ ID NO.: 6. It is therefore within the scope of the Invention to 
provide uses of any of the polypeptides of the present invention as medicaments that are 
therapeutically or prophylactically useful or both. 

5 Alternatively, such therapeutic and prophylactic effects can be obtained as a result of the 
expression of polypeptides of the Invention within a diseased subject or a subject at risk 
for contracting malaria. Therefore, It is also within the scope of the invention to provide 
uses of any of the nucleic acid molecules of the present Invention as medicaments that are 
therapeutically or prophylactically useful or both, 

10 

A currently particular preferred embodiment of the present invention relates to a vaccine 
comprising at least one nucleic acid according to the present Invetton or at least one vector 
according to the present invention, the vaccine effecting in vfvo expression of at least one 
antigen by a subject, to whom the vaccine has been administered, the amount of 
15 expressed antigen being effective to confer substantially increased resistance to malaria 
caused by Plasmodium falciparum. 

Embodied in the Invention is also a method for generating a vaccine against malaria 
comprising 

20 

a) Injecting a sequence according to any of claims 1-14 and/or 15-28 In a subject 

b) enabling said subject to generate antibodies specifically recognising any of the 
polypeptide sequences according to claim 1-14 

c) purify said antibodies 

25 d) selecting antibodies having cross-reactivity to parasites causing malaria 

e) selecting antibodies having the ability to Inhibit adhesion to endothelial cells. 

Medical use 
Polypeptides 

30 One presently particular preferred aspect of the present invention relates to an isolated 
polypeptide comprising an amino acid sequence selected from the group consisting of at 
least one of SEQ ID NO.: 2, SEQ IN NO.: 4 and SEQ ID NO.: 6 for use as a medicament. 

In a presently preferred embodiment the present Invention relates to use of any of the 
35 amino acid sequences according to the present Invention capable of mediating cyto- 

adhesion of intact erythrocyte infected by a parasite to human endothelial cells, but not to 
the CD36 receptor. 

Such amino acid sequences can be segregated by an assay known to the skilled person 
40 comprising Chinese hamster ovary (CHO) cells transfected to express human CD36 and 
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cultured by standard methodology to measure iRBC adhesion to these receptors. In brief, 

parasites can be radiolabelled by incubating the cultures overnight In the presence of H- 
phenylalanln (1 MBq for a standard culture containing 200 pi packed RBC). Wildtype and 
CD36-transfected CHO cells can be grown to a monolayer in 96-well microtltre plates 
5 (Nunc, Roskilde, Denmark). Late-stage-enrlched iRBC (100 pi, 1x10^ RBC/ml) can be 
added to the CHO cell monolayer and incubated for one hour at 37*C before unbound IRBC 
can be washed away from the CHO cell monolayer. Finally, the number of CHO-adhering 
iRBC could be determined by liquid scintillation spectrometry. 

10 In another presently preferred embodiment the present invention relates to use of any of 
the amino acid sequences according to the present invention, wherein said amino acid is 
consistently up-regulated after antibody selection-induced change from VSAum to VSAsm 
expression or following selection for adhesion to bone marrow derived endothelial cells. 
Said amino acids do not belong to PfEMPl groups B, C, B/C, varl or var2 as defined by 

15 Lavstsen et aL 2003, and are further characterised lacking 1-2 cysteine residues in DBLa 
homology group G (Smith et al. 2000) compared to most PfEMPl molecules in groups B 
and C. 

For the present aspects of the invention it applies, as describe by the inventors, that it is 
20 an object of preferred embodiments of the present invention to provide any polypeptides, 
which are subject to antibody recognition by antibodies in sera from young children living 
in areas of high malaria transmission intensity and/or capable of mediating cyto-adhesion 
of intact erythrocyte infected by a parasite to endothelial cells, but not the CD36 receptor, 
for use as a medicament. Thus in one embodiment, the amino acid sequence may have at 
25 least 80% sequence identity to SEQ ID NO.: 2, SEQ ID NO.: 4 and SEQ ID NO.: 6. 

It should be understood by the skilled addressee that any feature and/or aspect 
discussed above in connection with the sequence identity of the polypeptides 
according to the invention apply by analogy to antibodies according to the invention. 

30 

In another embodiment, the present invention relates to medical uses of an isolated 
polypeptide, wherern the amino acid sequence is a sub-sequence of with a minimum length 
of 10 amino acids. 

35 Again, it should be understood that any lenght of an isolated polypetide described above 
by analogy applies to the present aspect of the invention. 

A presently preferred embodiment relates to medical use of an isolated polypeptide 
comprising the amino acid sequence shown In SEQ ID NO:2. 

40 

Another presently preferred embodiment relates to medical use of an isolated polypeptide 
comprising the amino acid sequence shown in SEQ ID NO:4. 
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Another presently preferred embodinnent relates to medical use of an isolated polypeptide 
comprising the amino acid sequence shown in SEQ ID NO:6. 

A particular presently preferred embodiment relates to medical use of an isolated 
5 polypeptide consisting of the amino acid sequence shown in SEQ ID NO: 2. 

Another particular presently preferred embodiment relates to medical use of an isolated 
polypeptide consisting of the amino acid sequence shown in SEQ ID NO:4. 

10 Another particular presently preferred embodiment relates to medical use of an isolated 
polypeptide consisting of the amino acid sequence shown in SEQ ID NO: 6, 

One embodiment of the present invention relates to medial use of an isolated polypeptide 
according to the present invention, wherein the amino acid sequence has at least 80% 
15 sequence identity to SEQ ID NO:2. 

Another embodiment of the present invention relates to medial use of an isolated 
polypeptide according to the present Invention, wherein the amino acid sequence has at 
least 80% sequence identity to SEQ ID NO:4. 

20 

Yet another embodrment of the present invention relates to medial use of an isolated 
polypeptide according to the present invention wherein the amino acid sequence has at 
least 80% sequence identity to SEQ ID NO: 6. 

25 One specially preferred embodiment of the present invention relates to medical use of any 
of the polypeptides described herein, wherein the amino acid sequence is capable of 
mediating cyto-adhesion of intact erythrocyte infected by a parasite to human endothelial 
cells, but not to the CD36 receptor. 

30 Another specially preferred embodiment of the present invention relates to medical use of 
any of the polypeptides described herein wherein the amino acid is consistently up- 
regulated after antibody selection-induced change from VSAum to VSAsm expression. 

Nucleic acids 

35 The embodiments of the present invention thus relate to polypeptides of the PfEMPl class 
or sub-sequences hereof as well as nucleic acid molecules encoding such polypeptides or 
sub-sequences, thus one aspect of the present invention relates to medical uses of an 
isolated nucleic acid comprising a nucleotide sequence selected from the group consisting 
of at least one of SEQ ID NO.: 1, SEQ ID NO.: 3 and SEQ ID NO.: 5 for use as a 

40 medicament. 

As described by the present inventors the aim of the present invention is to provide a 
nucleic acid sequence encoding a polypeptide which is capable of mediating cyto-adhesion 
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of intact erythrocyte infected by a parasite to human endothelial cells and/or provide a 
nucleic acid sequence which is consistently upregulated after antibody selection-induced 
change from VSAum to VSAsm expression. 

5 This one embodiment relates to medical use of a nucleic acid, wherein the nucleotide 

sequence has at least 80% sequence identity to SEQ ID NO,: 1, SEQ ID NO.: 3 and/or SEQ 
ID NO.: 5. 

Another presently preferred embodiment relates to medical use of a nucleic acid wherein 
10 the nucleotide sequence is a sub-sequence of with a minimum length of 30 nuclotides. 

All the features described herein relating to the nucleic acid molecules of the present 
invention are also applicable as embodiments relating to the medical use of said nucleic 
acids, and vice versa. 

15 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid comprises the nucleotide sequence shown in SEQ ID NO:i. 

Another presently preferred embodiment relates to medical use of a nucleic add, wherein 
20 the nucleic acid consists of the nucleotide sequence shown in SEQ ID NO:l. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid comprises the nucleotide sequence shown in SEQ ID NO:3. 

25 Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid consists of the nucleotide sequence shown in SEQ ID NO:3. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid comprises the nucleotide sequence shown in SEQ ID NO: 5. 

30 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid consists of the nucleotide sequence shown in SEQ ID NO:5. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
35 the nucleotide sequence has at least 80% sequence identity to SEQ ID NO:l. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleotide sequence has at least 80% sequence identity to SEQ ID NO:3. 

40 Another presently preferred embodiment relates to medical use of a nucleic add, wherein 
the nucleotide sequence has at least 80% sequence identity to SEQ ID NO: 5. 
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Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the sequence Is consistently upregulated after antibody selection-induced change from 
VSAuM to VSAsM expression. 

5 Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid sequence encodes a polypeptide which Is capable of mediating cyto- 
adhesion of intact erythrocyte infected by a parasite to human endothelial ceils, but not 
the CD36 receptor. 

10 Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid sequence encodes a polypeptide which is recognised by plasma from 
young semi-immune children. 

In a further preferred embodiment nucleic acid sequence is a re-codonised sequence. 
15 Particularly preferred are sequences that are recodonised in order to enhance or optimise 
expression of the resulting protein or polypeptide in a given expression system. 

Vector 

Another aspect of the present Invention relates to a recombinant vector comprising any of 
20 the nucleic acids defined in the present application operably linked to one or more control 
sequences for use as a medicament. 

Further medfcaf uses 

In another aspect, the present invention relates to use of any of the polypeptides 
25 according to the present invention for the manufacture of a composition to be administered 
in order to prophylactically or therapeutically reduce the incidence, prevalence or severity 
of malaria in a subject. 

Thus, in one embodiment, the present invention relates to use of any of the polypeptides 
30 according to the present invention for the manufacture of a vaccine for prophylac of 
malaria. 

In another embodiment the present invention relates to use of any of the polypeptides 
according to the present invention for the manufacture of a composition for vaccination 
35 against malaria. 

As will be apparent, the present invention further relates to use of any nucleic acid 
according to the present invention for the manufacture of an composition to be 
administered in order to prophylactically or therapeutically reduce the Incidence, 
40 prevalence or severity of malaria in a subject. 
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In one embodiment, the present invention relates to use of any nucleic acid according to 
the present invention for the manufacture of a vaccine for malaria prophylaxis . 

In another embodiment the present invention relates to use of any nucleic acid according 
5 to the present invention for the manufacture of a composition for vaccination against 
malaria. 

As the skilled adressee would recognise the present Invention also relates to use of a 
recombinant vector according to the present invention for the manufacture of a 
10 composition to be administered in order to prophylactlcaily or therapeutically reduce the 
incidence, prevalence or severity of malaria in a subject. 

In one embodiment the present invention relates to use of a recombinant vector according 
to the present invention for the manufacture of a vaccine for prophylactic treatment of 
15 malaria. 

In another embodiment the present invention relates to use of a recombinant vector 
according to the present invention for the manufacture of a composition for vaccination 
against malaria. 

20 

In a presently preferred embodiment of the present invention said malaria is caused by 
Plasmodium falciparum. 

Treatment 

25 

It should be understood that any feature and/or aspect discussed above in connection 
with the uses according to the invention apply by analogy to methods of treatment 
according to the invention. 

30 Thus, is one aspect the present invention relates to a method for prophylactlcally or 
therapeutically reduce the incidence, prevalence or severity of malaria in an subject said 
method comprising administering to said subject an effective amount of a polypeptide, a 
nucleic acid or a recombinant vector according to the present invention. 

35 In one embodiment the present invention relates to a method for the prophylactic 

treatment of malaria in a subject, said method comprises administering to said subject an 
effective amount of a polypeptide, a nucleic or a recombinant vector according to the 
present invention. 
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In another embodiment the present invention relates to a vaccination method against 
malaria in a subject, said vaccination method comprising administering to said subject an 
effective amount of a polypeptide, a nucleic add or a recombinant vector according to the 
present invention. 

5 

Composition 

Another aspect of the present invention relates to use of the polypeptides and/or the 
nucleic acids as defined by the present application for the preparations of a composition. 
Thus in one embodiment the invention relates to a composition comprising a polypeptide 
10 according to the invention or a nucleic acid according to the invention and a 
pharmaceutically acceptable diluent, carrier or adjuvant. 

In a presently preferred embodiment the composition Is an immunogenic composition. 

15 In a presently most preferred embodiment the composition induces an IgG/IgM antibody 
response. 

Production of pharmaceuticals 

Another aspect of the present invention may be the production of pharmaceuticals based 
20 on polypeptides of the invention or sub-sequences hereof or nucleic acid sequences 
encoding such molecules, as described above. Such pharmaceuticals may also include 
agents such as but not limited to other polypeptides and in particular antibodies, which are 
capable of modulating and/or inhibit the adhesion of SEQ ID NO.: 2, SEQ ID NO.:4, SEQ ID 
NO.:6 and/or homologous hereof to endothelial receptors. 

25 

Accordingly, it is within the scope of the invention to provide the use of any of the amino 
acid sequences according to the present invention for the manufacture of a composition, 
such as an immunogenic composition, which is to be administered in order to 
prophylactically or therapeutically reduce the incidence, prevalence or severity of severe 
30 malaria (SM) In an indlviduaJ non-Immune to SM causing parasites. 

In addition, the Invention also relates to the use of a nucleic acid molecule according to the 
present invention for the manufacture of a composition, such as an immunogenic 
composition, which is to be administered In order to prophylactically or therapeutically 
35 reduce the Incidence, prevalence or severity of severe malaria in an Individual non--immune 
to SM causing parasites. 
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Delivery of these pharmaceuticals can be performed by any conventional route Including, 
but not limited to, transdermal, parenteral, gastrointestinal, transbronchial, and 
transalveolar administration, 

Biotechnological tools 

5 The use of the nucleic acid and polypeptide-based embodiments of the present invention 
can also extend to their use as biotechnological tools and as components of diagnostic 
assays. 

Thus^ one embodiment relates to an in vitro diagnostic method, said method comprising 
10 contacting a sample with any of the polypeptides according to the present invention under 
conditions allowing an m vitro immunological reaction to occur between said polypeptide 
and the antibodies possibly present in said sample, and in vitro detect the antigen- 
antibody complexes possibly formed. 

15 In a presently preferred embodiment said diagnostic assay, further relates to an in vitro 
diagnostic method, wherein a disease-state profile for a tested subject is generated by 
determining the concentration or expression level in a sample of any of the polypeptide 
and/or nucleic acid sequences as defined in the present application. 

20 Additional embodiments of the Invention therefore mclude an in vitro diagnostic method, 
which comprises contacting a sample such as a tissue or biological fluid with a polypeptide 
comprising a sequence selected from the group consisting of at least one of 

a) SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6; and 
25 b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 

d) sub-sequences of a) or b) comprising at least one B-cell epitope 

under conditions allowing an in vitro immunological reaction to occur between said 
30 polypeptide composition and the antibodies possibly present in the biological sample, and 
the in vitro detection of the antigen-antibody complexes possibly formed. In one preferred 
embodiment the polypeptide is immobilised on a solid support. 

Other embodiments include an in vitro diagnostic method, which comprises contacting a 
35 sample such as a tissue or biological fluid with a nucleotide composition comprising a 
sequence selected from the group consisting of at least one of 

a) SEQ ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 or a sequence 
complementary thereof; and 
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b) a nucleotide sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a nninimunr^ length of 18 nucleotides 

5 

under conditions allowing an in vitro reaction to occur between said nucleotide composition 
and e.g. primers and probes. 

In some aspects, the nucleic acid embodiments are employed as nucleic acid probes in 
10 hybridisation assays, in cloning, or as primers for polymerase chain reaction (PGR). 
Similarly, the pofypeptide-based embodiments can be used as components of 
immunological reactions such as ELISA, radio-immunoassays (RIA) and adhesion-blocking 
assays- The scope of such work can be, for example, to characterise SEQ ID NO.: 2, SEQ 
ID NO.: 4, and/or SEQ ID NO.: 6 or regions of SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ 
15 ID NO.: 6 involved in interaction with endothelial receptors or antibodies 

In some diagnostic embodiments, nucleic acids complementary to the nucleic acid 
molecules of the invention or fragments hereof are used to identify SEQ ID NO.: 1, SEQ ID 
NO-: 3, and/or SEQ ID NO.: 5 nucleic acids (e.g. mRNA) present in a biological sample, for 

20 instance a tissue sample or a sample of body fluid such as blood or serum. In a preferred 
diagnostic embodiment, nucleic acid molecules complementary to fragments of SEQ ID 
NO.: 1, SEQ ID NO.: 3, and/or SEQ ID NO.: 5 comprising sequences, which are not found 
in nucleic acids encoding other SEQ ID NO.: 2, SEQ ID NO.: 4, and/ SEQ ID NO.: 6 
proteins, are used to identify SEQ ID NO.: 1, SEQ ID NO.: 3, and/or SEQ ID NO.: 5 nucleic 

25 acids (e.g. mRNA) present in a biological sample. 

In other diagnostic embodiments, nucleic acids identical to the nucleic acid molecules of 
the invention or fragments hereof are used to Identify SEQ ID NO.: 1, SEQ ID NO.: 3, 
and/or SEQ ID NO.: 5 nucleic acids (e.g. mRNA) present In a biological sample, for 

30 Instance a tissue sample or a sample of body fluid such as blood or serum. In a preferred 
diagnostic embodiment, nucleic acid molecules identical to fragments of SEQ ID NO.: 1, 
SEQ ID NO.: 3, and/or SEQ ID NO.: 5 comprising sequences, which are not found in 
nucleic acids encoding other SEQ ID NO.: 2, SEQ ID NO.: 4, and/ SEQ ID NO.: 6 proteins, 
are used to identify SEQ ID NO.: 1, SEQ ID NO.: 3, and/or SEQ ID NO.: 5 nucleic acids 

35 {e.g. mRNA) present in a biological sample. 

The concentration or transcription level in the infected subject of SEQ ID NO.: 1, 3, and/or 
5 nucleic acids or other nucleic acids, which encode proteins that can mediate adhesion to 
endothelial cells will differ depending on the type of Piasmodium infection. Thus, some 
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Plasmodium parasites will only cause the expression of low amounts of SEQ ID NO.: 2, 
SEQ ID NO,: 4, and/ or SEQ ID NO,: 6 or no expression at all. Likewise it will not be 
possible to detect any expression of SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 
In subjects that are not carrying a Plasmodium infection. Accordingly, malaria and, more 
5 specifically, severe malaria can be diagnosed by determining the concentration of SEQ ID 
NO.: 1, 3, and/or 5 gene transcripts fn an individual at risk of contracting this disease. In 
the case of severe malaria such individuals may be e.g. Infants or young children who live 
In endemic areas, and previously unexposed individuals travelling into endemic areas. 

10 One embodiment of the present invention is therefore an In vitro diagnostic method 

whereby infection with Plasmodium and more specifically infection with P. falciparum can 
be detected. In a preferred embodiment, a disease state profile can be created by 
collecting data on the transcription level of SEQ ID NO.: 1, 3, and/or 5 in a large number 
of Infected subjects and subsequent using these sets of data as reference. The 

15 concentration or transcription level of SEQ ID NO.: 1, 3, and/or 5 detected in a tested 
subject can then be compared to this reference material so as to predict or follow the 
disease-state of that particular individual. Thus, In some embodiments the term " SEQ ID 
NO.: 1, 3, and/or 5 disease-state profile" refers to the concentration or transcription level 
or concentration range or transcription level range of a nucleic acid sequence encoding 

20 SEQ ID NO.: 2, 4, and/or 6 or a part hereof that is detected in a biological sample. Arrays 
comprising nucleotide probes comprised by the nucleotide sequence of the invention or 
fragments hereof or real-time quantitative PGR can be used to create such disease-state 
profiles. 

25 In a similar fashion to that discussed above, a SEQ ID NO.: 2, 4, and/or 6 disease-state 
profile comprising concentration levels or concentration range levels of SEQ ID NO.: 2, 
SEQ ID NO.: 4, and/or SEQ ID NO.: 6 amino acid sequences in healthy and diseased 
subjects can be.created and used to follow tine dlsease-state of an individual. Accordingly, 
in some embodiments the term " SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO-: 6 

30 disease-state profile" refers to the concentration or concentration range or the expression 
level or expression level range of a polypeptide corresponding to SEQ ID NO.: 2, SEQ ID 
NO.: 4, and/or ^SEQ ID NO.: 6 or a part hereof in a biological sample. Preferred methods 
for detecting such proteins or polypeptides include radioactive or non-radioactive Immune 
based approaches such as ELISA or radlo-immunoassays as well as standard membrane- 

35 blotting techniques. 

TTie invention also relates to a method for the In vitro detection of antibodies, which 
correlate with malaria originating from the infection of an individual P. falciparum in a 
tissue or biological fluid likely to contain such antibodies. This procedure comprises 
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contacting a biological fluid or tissue sample as defined above with a preparation of 
antigens comprising the polypeptide of the invention or any part hereof under conditions, 
which allow an in vitro immunological reaction to occur between these antigens and the 
antibodies possibly present in the tissue or fluid. It further comprises the in vitro detection 
5 of the antigen-antibody complexes possibly formed by the use of conventional techniques. 
As an example, a preferred method involves the use of techniques such as ELISA, as well 
as immuno-ffuorescent or radio-immunological assays (RIA) or equivalent procedures. 
Again, such techniques can be used for collecting data on the concentration of antibodies 
against the polypeptide of the invention or parts hereof in subjects infected with 

10 Piasmodlum parasites. These data can serve as reference when compared to the 

concentration of antibodies against the polypeptide of the invention detected in a given 
subject and a disease-state profile can be generated on the basis hereof. Thus, in some 
embodiments the term " SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 disease- 
state profile" refers to the concentration or concentration range of SEQ ID NO.: 2, SEQ ID 

15 NO.: 4, and/or SEQ ID NO.: 6 antibodies, which are detected in a biological sample. 

Further, some aspects of the Invention relate to the process of identifying compounds or 
compositions, which can be employed in the therapeutic treatment or prophylacxis of 
malaria. This may for instance be a method for identifying agents capable of modifying the 
20 SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 dependent adhesion to human 

endothelial cells. When contacted with the agent(s) of interest, the adhesion of the iRBC to 
human endothelial cells is avoided. Alternatively, interaction of the agent(s) with the 
expressed polypeptides is avoided. 

Kits 

25 

In other aspects of the invention, kits are provided which will simplify the use of the 
polypeptide and nucleotide embodiments of the Invention for in vitro diagnostic purposes. 

Such an in vitro diagnostic kit may comprise 

30 

a) any polypeptide and/or nucleic acid sequence as defined in the present 
application, 

b) reagents for preparing a suitable medium for carrying out an 

35 immunological reaction between an antibody present in a sample of body 

fluid or tissue and said sequence; and 
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c) reagents allowing the detection of the antigen-antibody complexes formed, 
wherein said reagents may bear a radioactive or non-radioactive label. 

In addition to this component, the kit may comprise reagents for preparing a suitable 
5 medium for carrying out an immunological reaction between an antibody present in a 
sample of body fluid and said sequence; and reagents allowing the detection of the 
antigen-antibody complexes formed, wherein said reagents may bear a radioactive or non- 
radioactive label. 

10 Such an in vitro diagnostic kit may also comprise a cDNA chip with probes that have 
nucleotide sequences complementary to cDNA of the PfEMPl described in the present 
application; primers for amplifying DNA obtained from clinical samples by PGR; and, means 
for labeling amplified DNA hybridized with the probes of the said cDNA chip. The cDNA chip 
may further comprise position markers to locate probes, and staining or labeling is 

15 performed by using, means for labeling comprising preferably biotin-binding material, most 
preferably, streptav!din-R-phycoerythrin which is a conjugate of a fluorophore and a 
protein with biotin-binding sites, The process for preparing cDNA chip contained in the kit 
comprises the steps of: preparing 5' terminal amine-linked DNA probes which have 
nucleotide sequences complementary to cDNA of the PfEI^Pl described in the present 

20 application; affixing the DNA probes thus prepared to an aldehyde-derivatized solid 
surface; and, reducing excessive aldehydes not reacted with amine. 

According to the present invention, Piasmodium faidparum in a cell or tissue can be 
detected by quantitatively measuring the level of one or more nucleic acids of the present 

25 invention using, e.g., real time polymerase chain reaction (PGR) with at least one 

oligonucleotide primer pair or oligonucleotide specific probe being capable of distinguishing 
between the PfEMPl described by the present application and other polypeptides. In 
general, a quantitatively measured level in a biological sample from a subject, e.g., human 
of any of the nucleic acids described in the present application that is higher than the 

30 quantitatively measured level in a biological sample from a normal subject is indicative of 
malaria in the subject. 

The kit of the invention is an implement that can detect malaria infection in a simple and 
accurate manner, as well as identify the types of infecting paracites, therefore, it may 
35 contribute to eariy diagnosis, prevention and treatment of malaria. 

Alternatively, the in vitro diagnostic kit may comprise antibodies which specifically 
recognise a sequence selected from the group consisting of at least one of a) SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6; and b) a sequence having at least 80% sequence 
40 identity to a); and c) sub-sequences of a) or b) with a minimum length of 30 amino adds; 
and d) sub-sequences of a) or b) comprising at least one B-cell epitope as well as reagents 
for preparing a suitable medium for carrying out an immunological reaction between said 
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antibody and a sequence possibly present in a sample of body fluid or tissue and reagents 
allowing the detection of the antigen-antibody compiexes formed. Said agents or said 
antibodies may optionally bear a radioactive or non-radioactive label. 

5 A variety of assays can be utilized in order to detect antibodies that specifically bind to the 
desired polypeptide. Exemplary assays are described in detaii in Antibodies: A Laboratory 
Manual, Harlow and Lane (eds.)i Cold Spring Harbor Laboratory Press, 1988. 

Representative examples of such assays include: countercurrent immuno-electrophoresis 
10 (CIEP), radioimmunoassays, radioimmunoprecipitations^ enzyme-linked immuno-sorbent 
assays (ELISA), dot blot assays. Inhibition or competition assays, and sandwich assays, 
immunostick (dipstick) assays, simultaneous immunoassays, immunochromatographfc 
assays, immunofiltration assays, latex bead agglutination assays, immunofluorescent 
assays, biosensor assays, and low-light detection assays. 

15 

Still another aspect of the present invention provides antibodies, as discussed above, for 
detecting Plasmodium falciparum in diagnostic tests. Such antibodies are useful in a wide 
variety of antibody-based assays. As discussed above, exemplary assays are described in 
detail in Antibodies: A Laboratory Manual, (supra); U.S. Pat. No. 4,736,110 and U.S. Pat. 
20 No. 4,486,530. 

These antibody- based diagnostic tests include but are not limited to the following tests: 

A fluorescent antibody test (FA-test) uses a fluorescently-labeled antibody able to bind to 
25 one of the polypeptides of the invention. For detection, visual determinations are made by 
a technician using fluorescence microscopy, yielding a qualitative result. In a preferred 
embodiment, this assay is used for the examination of samples such as but not limited to 
blood samples. 

30 FACS assay using fluorescently-iabeled antibodies to bind one of the polypetides of the 
present invention expressed on the surface of R8C, yieldrng a quantitative result. In a 
preferred embodiment, this assay Is used for the examination of samples such as but not 
limited to blood samples. 

35 In latex bead agglutination assays, antibodies to one or more of the proteins of the present 
invention are conjugated to latex beads. The antibodies conjugated to the latex beads are 
then contacted with a sample under conditions permitting antibodies to bind to desired 
proteins in the sample, if any. The results are then read visually, yielding a qualitative 
result. In a preferred embodiment, this format can be used in the field for on-site testing. 

40 

Enzyme immunoassays (EIA) include a number of different assays able to utilize the 
antibodies provided by the present invention. For example, a heterogeneous indirect EIA 
uses a solid phase coupled with an antibody of the invention and an affinity purified, anti- 
IgG immunoglobulin preparation. Preferably, the solid phase is a polystyrene microtlter 
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plate. The antibodies and immunoglobulin preparation are then contacted with the sample 
under conditions permitting antibody binding, which conditions are well known In the art. 
The results of such an assay can be read visually, but are preferably read using a 
spectrophotometer, such as an ELISA plate reader, to yield a quantitative result. 

5 

An alternative solid phase EIA format includes a plastic-coated ferrous metal beads able to 
be moved during the procedures of the assay by means of a magnet. Yet another 
alternative is a low-light detection immunoassay format. In this highly sensitive format, 
the fight emission produced by appropriately labeled bound antibodies are quantitated 
10 automatically. Preferably, the reaction is performed using microtiter plates. 

In a capture-antibody sandwich enzyme assay, the desired protein is bound between an 
antibody attached to a solid phase, preferably a polystyrene microtiter plate, and a labeled 
antibody. Preferably, the results are measured using a spectrophotometer, such as an 
15 ELISA plate reader. 

In an alternative embodiment, a radioactive tracer Is substituted for the enzyme mediated 
detection in an EIA to produce a radioimmunoassay (RIA). 

20 In a sequential assay format, reagents are allowed to incubate with the capture antibody in 
a stepwise fashion. The test sample Is first incubated with the capture antibody. Following 
a wash step, incubation with the labeled antibody occurs. In a simultaneous assay, the two 
incubation periods described in the sequential assay are combined- This eliminates one 
incubation period plus a wash step. 

25 

A dipstick/immunostick format is essentially an Immunoassay except that the solid phase, 
instead of being a polystyrene microtiter plate, is a polystyrene paddle or dipstick. 
Reagents are the same and the format can either be simultaneous or sequential. 

30 In a chromatographic strip test format, a capture antibody and a labeled antibody are 
dried onto a chromatographic strip, which is typically nitrocellulose or nylon of high 
porosity bonded to cellulose acetate. The capture antibody Is usually spray dried as a line 
at one end of the strip. At this end there is an absorbent material that is in contact with 
the strip. At the other end of the strip the labeled antibody is deposited in a manner that 

35 prevents it from being absorbed into the membrane. Usually, the label attached to the 
antibody is a latex bead or colloidal gold. The assay may be initiated by applying the 
sample immediately In front of the labeled antibody. 

Immunoflltration/immunoconcentration formats combine a large solid phase surface with 
40 directional flow of sample/reagents, which concentrates and accelerates the binding of 
antigen to antibody. In a preferred format, the test sample is preincubated with a labeled 
antibody then applied to a solid phase such as fiber filters or nitrocellulose membranes or 
the like. The solid phase can also be precoated with latex or glass beads coated with 
capture antibody. Detection of anaiyte is the same as standard immunoassay. The flow of 
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sample/reagents can be modulated by either vacuum or the wicking action of an 
underlying absorbent material. 

A threshold biosensor assay is a sensitive, instrumented assay amenable to screening 
5 large number of samples at low cost. In one embodiment, such an assay comprises the 
use of light addressable potentiometric sensors wherein the reaction involves the detection 
of a pH change due to binding of the desired protein by capture antibodies, bridging 
antibodies and urease-conjugated antibodies. Upon binding, a pH change is effected that Is 
measurable by translation into electrical potential (pvolts). The assay typicaliy occurs in a 
10 very small reaction volume, and is very sensitive. Moreover, the reported detection limit of 
the assay is 1,000 molecules of urease per minute. 

For diagnostic methods, which are based on detecting the presence of polypeptides of the 
invention sub-sequences with a low degree of sequence identity to polypeptides, which are 
15 unrelated to SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO,: 6 are preferred. 

In a preferred embodiment, the kit comprises a solid support to which the antibodies of the 
kit are coupled. Such a support may for instance comprise an organic polymer. 

20 In an additional embodiment, the kit comprises one or more doses of a vaccine in addition 
to the diagnostic components as described above. It is contemplated that such a kit may 
simplify the process of identifying and treating subjects in need of one of the therapeutic 
or prophylactic embodiments of the invention. Furthermore, the diagnostic components of 
a kit may be used to determine the presence of antibodies and thereby the efficiency of 

25 the vaccine In each Individual subject. 

In certain embodiments a kit comprises preparations of the polypeptide and/or nucleotide 
embodiments of the invention filled in a number of separate containers. The containers can 
be entirely separate or can be constituted by separate chambers of the same applicator 

30 device. Where the containers are separate, they could be provided in the form of a kit 
comprising separate dispensers or syringes. Where the containers form part of the same 
applicator, they could for example, be defined by separate barrels of a multi-barrel 
syringe. A kit may thus comprise containers and/or barrels, where one container or barrel 
contains an immunogenic substance and another container or barrel contains a diluent 

35 and/or a carrier and/or an adjuvant. Other containers or barrels may contain diagnostic 
components. 

Novel agents 

Within the scope of the present invention are also methods for identifying and/or designing 
40 novel agents useful in the prevention or treatment of malaria. Embodied in the Invention Is 
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therefore a method for testing an mhibitor-molecule capable of inhibiting binding of any of 
the polypeptides according to the present invention to a receptor expressed on endotheliai 
cells comprising 

a) in Vitro cultures of endothelial cells 
5 b) add potential inhlbfting-molecule 

c) add RBC Infected with parasites, said iRBC expressing any of said 
polypeptide sequences on their surface of the RBC 

d) measure the binding of the IRCB with said endothelial celts by 
microscopy or other means of quantifying binding as for instance liquid 

10 scintillation spectrometry. 

Embodied in the invention is also a method for identifying an agent, which is capable of 
disrupting the Plasmodium life cycle, and an agent, which specifically modulates SEQ ID 
NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 dependent adhesion to endothelial 
15 receptors, the method comprising providing a cell expressing an amino acid sequence 
selected from the group consisting of at least one of 

a) SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6; and 

b) a sequence having at least 80% sequence identity to a); and 

20 c) sub-sequences of a) or b) with a minimum length of 30 amino adds; and 

d) sub-sequences of a) or b) comprising at least one B-cell epitope 

and contacting said cell with the agent and detecting adhesion of said cell to endothelial 
receptors. 

25 

By this approach, an agent, which inhibits adhesion of a polypeptide of the invention to 
endothelial receptors, can be identified by contacting endothelial receptors with 
polypeptides of the invention or sub-sequences thereof in the presence of the agent- 
Detection is accomplished and successful agents Identified - according to their ability to 
30 induce a desired modulation of the formation of complexes of endothelial receptors and 
polypeptides of the invention. 

In a preferred embodiment, this method is based on the detection of cells, which adhere to 
endothelial cells on a solid support. Again, such a support may for instance comprise a 
35 resin, a membrane, an organic polymer, a lipid or a cel> or part thereof. According to 
another aspect of the invention a support comprising a polypeptide of the invention or a 
fragment thereof coupled to it can be used to capture endothelial receptors and thereby 
identify substances that are capable of modulating the interaction of endothelial receptors 
and a polypeptide of the invention. The method may be based on directly or indirectly 



36017DK01 



45 

labelled endothelial receptors or a labelled polypeptide of the invention as well as the 
labelling of whole cells using radioactive as well as non-radioactive techniques. 

Another possibility of using the polypeptide embodiments of the present Invention is the 
5 development of a method for identifying an agent, which Interacts with an amino acid 
sequence selected from the group consisting of at least one of 

a) SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6; and 

b) a sequence having at least 80% sequence identity to a); and 

10 c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 

d) sub-sequences of a) or b) comprising at least one B-cell epitope; 

said method comprising providing a cell expressing one or more of said polypeptides; 
contacting said cell with the agent; and detecting the interaction of the agent with one or 
15 more of the said polypeptides. 

In preferred embodiments, the agents identified by the use of these methods are 
monoclonal or polyclonal antibodies. 

20 In addition, these methods can be used to Identify compounds that will induce a desired 
immune response in a subject or patient and thereby serve as valuable tools in the 
development of novel medical compositions as for Instance vaccines. Therefore, In a 
preferred embodiment of the invention, the methods described above are used for 
identifying polypeptides, which will Induce a specific antibody response upon 

25 administration to a subject in need hereof, or nucleotide sequences encoding such amino 
acid sequences. Use of the methods for this purpose comprises injecting into a living 
organism one or more of the polypeptides defined above, contacting a tissue or a biological 
fluid sample from said organism with said polypeptides; allowing an in vitro reaction to 
occur between the polypeptides and antibodies possibly present in the biological tissue; 

30 and the in vitro detection of complexes possibly formed. 

An additional preferred embodiment is a method as described above wherein said tissue or 
said biological fluid sample is contacted with polypeptides expressed on the surface of a 
cell. 

35 

An equally preferred embodiment is a method as described above wherein said tissue or 
said biological fluid sample is contacted with polypeptides expressed on the surface of 
erythrocytes selected for adhesion to endothelial cells or for increased antibody 
recognition. 
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Finally, another preferred embodiment of the invention is a method as described above 
wherein said tissue or biological fluid sample is contacted with polypeptides immobilised on 
a solid support. 

5 

In other embodiments, protein models of the polypeptides of the invention are constructed 
by the use of conventional techniques within molecular biology^ Agents that interact with 
polypeptides of the invention are constructed and approaches in combinatorial chemistry 
are employed in the development of agents that modulate SEQ ID NO.: 2, SEQ ID NO.: 4, 

10 and/or SEQ ID NO, : 6 mediated interaction with endothelial receptors or are able to induce 
an immune response. Accordingly, novel agents that interact with SEQ ID NO.: 2, SEQ ID 
NO.: 4, and/or SEQ ID NO.: 6 are developed, screened in a SEQ ID NO.: 2, SEQ ID NO.: 4, 
and/or SEQ ID NO.: 6 characterisation assay, for instance a SEQ ID NO.: 2, SEQ ID NO.: 
4, and/or SEQ ID NO.: 6 anti-adhesion assay as described above. The identity of each 

15 agent and its performance in the SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 
characterisation assay, its effect on the modulation of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and/or SEQ ID NO.: 6-medlated adhesion to endothelial cells or its ability to Induce an 
immune response Is recorded on electronic or non-electronic media. These recorded data 
can serve as the basis for a library of SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 

20 6 modulating agents. Such a library can again be employed to further identify agents that 
modulate SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6-mediated adhesion to 
endotheMa) cells and can be valuable tools for selecting an appropriate pharmaceutical to 
treat a particular type of infection with Plasmodfum, It is further expected that the high 
throughput screening techniques currently In use within the biotech and pharmaceutical 

25 Industries can readily be applied to the procedures outlined above. 

Host cells 

With respect to the above embodiments, the invention further relates to host cells 
comprising the above-described nucleic acid molecules. The nucleic acid molecules may be 

30 transformed, stably transfected or transiently transfected into the host cell or infected Into 
the host cell by a live attenuated virus. The preferred host cells may include, but are not 
limited to, proicaryotic cells, such as Escherichia coif, Staphylococcus aureus, and 
eukaryotic cells, such as Sacchromyces cerevtsiae and Pichia pastons, CHO and COS cells 
as well as Baculovirus infected hi-five or sf9 insect cells. Transformation with the 

35 recombinant molecules can be effected using methods well known In the art. 

1 



36017DK01 



47 

It should be understood that any feature and/or aspect discussed above in connection with 
the use according to the invention apply by analogy to methods of treatment or prevention 
of malaria according to the invention. 

5 It should be understood that in a particular preferred embodiment of all the ascpects of the 
present invention, the malaria is severe malaria. 

As will be apparent, preferred features and characteristics of one aspect of the invention 
may be applicable to other aspects of the invention. 

10 

Throughout this specification the word "comprise", or variations such as "comprises" or 
"comprising", will be understood to imply the inclusion of a stated element, integer or step, 
or group of elements, integers or steps, but not the exclusion of any other element, 
integer or step, or group of elements, integers or steps, 

15 

All patent and non-patent references cited in the present application, are hereby 
incorporated by reference in their entirety. 

The invention will hereinafter be described by way of the following non-limiting Definitions, 
20 Figures and Examples. 

Definitions 

'Vaccine'' refers to a preparation of SEQ ID NO.: 1, and/or SEQ ID NO.: 2, and/or SEQ ID 
25 NO.: 3, SEQ and/or ID NO.: 4, and/or SEQ ID NO.: 5, and/or SEQ ID NO.: and/or 6, which 
can Induce protective immunity against severe malaria, but which does not itself cause 
disease. 

**Medicament" reiates to any composition comprising any of the polypeptides and/or nucleic 
30 acids describe herein for treatment of malaria and/or preventition of Initiation of malaria 
and/or prophylaxis of malaria infection, 

VSA' refers to variant surface antigens expressed on the surface of RBC infected by 
Plasmodium falciparum. In the present context the variant surface antigen is PfEMPl. 

35 

•Serological phenotype* refers to the antibody profile obtained by FACS analysis of RBC 
infected by R falciparum expressing VSA on the surface of said RBC. 

3D7 refers to a specific laboratory isolate of a Plasmodium falciparum 3D7, which is a long- 
40 term clone derived from P. falciparum NF54 isolated from a Dutch malaria patient 
(Delemarre and Van der Kaay, 1979). 
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Unless otherwise defined herein or below in the remainder of the specification, ail technical 
and scientific terms used herein have the same meaning as commonly understood by 
those of ordinary skill in the art to which the Invention belongs. 

5 'SEQ ID NO.: 1' is defined as the sequence of the two identical genes with the PlasmoDB 
accession numbers PFD1235W and MAL7P1.1 { http : //www. plasmodb. org ) and the NCBI 
accession number NC 004318 . 

'SEQ ID NO.: T is defined as the protein product of the two identical genes with the 
10 PlasmoDB accession numbers PFD1235w and MAL7P1.1 ( http://www.plasmodb.orQ ) and 
the NCBI accession number NC 004318 - 

The term •VAR4^ is defined as SEQ ID NO.: 2. 

15 'SEQ ID NO.: 3' is defined as the sequence of the gene with the PlasmoDB accession 
number PF11_0008 ( http://www.plasmodb.orQ ) and the NCBI accession number 
NC 004315 . 

•SEQ ID NO.: 4* Is defined as the protein product of the gene with PlasmoDB accession 
number PF11_0008 ( http://www.plasmodb,ora ) and the NCBI accession number 
20 NC 004315 . 

The term 'VAR5' is defined as SEQ ID NO.: 4. 

'SEQ ID NO.: 5' is defined as the sequence of the gene with the PlasmoDB accession 
25 number PF13_0003 ( httD://www. plasmodb. org ) and the NCBI accession number 
NC 004331 . 

•SEQ ID NO.: 6' Is defined as the protein product of the gene with the PlasmoDB accession 
number PF13_0003 f httD://www.plasmodb.orq ) and the NCBI accession number 
NC 004331 . 

30 

The term VAR6* is defined as SEQ ID NO.: 6. 

The therm 'Var" 1 is defined as the DNA and amino acid sequence identified by the 
PlasmoDB accession numbers PFE1640w ( http://ww w.plasmodb.ora) and the NCBI 
35 accession number NC 004326 and homologous with an identity of 80%. 

The term "var2" is defined as the DNA and amino add sequences identified by the 
PlasmoDB accession numbers PL0030c ( http : //ww w. plasm odb . org ) and the NCBI accession 
number NC 004316 and homologous with an Identity of 80%. 

40 

The term 'Var3" Is defined as three DNA and amino acid sequences identified by the 
PlasmoDB accession numbers PFAOOlSc, MAL6P1.314, and PFI1820w 
fhtto://www.plasmodb.orQ > and the NCBI accession numbers NC QQ4325, NC 004327. 
and NC 00433Q respectively and homologous with an identity of 80%. 
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The term '"Exon 2" refers to the nucleotides no, 9444-10662 of SEQ ID NO.: 1, the 
nucleotides no. 7722-8985 of SEQ ID NO.: 3, the nucleotides no. 8847-10041 of SEQ ID 
NO.: 5, amino add no.3148-3553 of SEQ ID NO.: 2, amino acid no. 2574-2994 of SEQ ID 
5 NO.: 4, and amino acid no. 2949-3346 of SEQ ID NO.: 6, 

The term "Fragment 1" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 1-300 and amino acid no. 1-100 of SEQ ID NO.: 2, SEQ ID NO.: 4, and 
SEQ ID NO.: 6. 

10 

The term ^Tragment 2" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO,: 5 refers to 
nucleotides no. 301-600 and amino acid no. 101-200 of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and SEQ ID NO.: 6. 

15 The term "Fragment 3" of SEQ ID NO. 1, SEQ ID NO,: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 601-900 and amino acid no. 201-300 of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and SEQ ID NO.: 6. 

The term Fragment 4" of SEQ ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
20 nucleotides no. 901-1200 and amino acid no. 301-400 of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and SEQ ID NO.: 6. 

The term "Fragment 5" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 1201-1500 and amino acid no. 401-500 of SEQ ID NO.: 2, SEQ ID NO.: 4, 
25 and SEQ ID NO.: 6. 

The term "Fragment 6" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 1501-1800 and amino acid no. 501-600 of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and SEQ ID NO.: 6. 

30 

The term "Fragment 7" of SEQ ID NO, 1, 3, and 5 refers to nucleotides no. 1801-2100 and 
amino acid no. 601-700 of SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 8" of SEQ ID NO. 1, 3, and 5 refers to nucleotides no. 2101-2400 and 
35 amino acid no. 701-800 of SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 9" of SEQ ID NO. 1, 3, and 5 refers to nucleotides no. 2401-2700 and 
amino acid no. 801-900 of SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

40 The term "Fragment 10" of SEQ ID NO. 1, 3, and 5 refers to nucleotides no. 2701-3000 
and amino acid no. 901-1000 of SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.; 6. 
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The term ^Tragment 11" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 3001-3300 and amino acid no, 1001-1100 of SEQ ID NO.: % SEQ ID NO,: 
4, and SEQ ID NO.: 6. 

5 

The term 'Tragment 12" of SEQ ID NO. 1, 3 SEQ ID NO.:, and SEQ ID NO.: 5 refers to 
nucleotides no. 3301-3600 and amino add no. 1101-1200 of SEQ ID NO.: 2, SEQ ID NO.: 
4, and SEQ ID NO.: 6. 

10 The term "Fragment 13" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 3601-3900 and amino add no. 1201-1300 of SEQ ID NO.: 2, SEQ ID NO.: 
4, and SEQ ID NO.: 6. 

The term "Fragment 14" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
15 nucleotides no. 3901-4200 and amino acid no. 1301-1400 of SEQ ID NO.: 2, SEQ ID NO.: 
4, and SEQ ID NO.: 6. 

The term "Fragment 15" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 4201-4500 and amino acid no. 1401-1500 of SEQ ID NO.: 2, SEQ ID NO.: 
20 4, and SEQ ID NO.: 6. 

The term "Fragment 16" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no, 4501-4800 and amino add no. 1501-1600 of SEQ ID NO.: 2, SEQ ID NO. 
4, and SEQ ID NO.: 6. 

25 

The term "Fragment 17" of SEQ ID NO. 1, SEQ ID NO,: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 4801-5100 and amino acid no. 1601-1700 of SEQ ID NO.: 1, SEQ ID NO. 
4, and SEQ ID NO.: 6. 

30 The term "Fragment 18" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 5101-5400 and amino acid no. 1701-1800 of SEQ ID NO.: 2, SEQ ID NO. 
4, and SEQ ID NO.: 6. 

The term "Fragment 19" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
35 nucleotides no. 5401-5700 and amino acid no. 1801-1900 of SEQ ID NO-: 2, SEQ ID NO. 
4, and SEQ ID NO.: 6. 

The term "Fragment 20" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 5701-6000 and amino acid no. 1901-2000 of SEQ ID NO.: 2, SEQ ID NO. 
40 4, and SEQ ID NO.: 6. 
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The term "Fragment 21" of 5EQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 6001-6300 and amino add no. 2001-2100 of SEQ ID NO.: 2, SEQ ID NO.: 
4, and SEQ ID NO*: 6. 

5 

The term 'Tragment 22" of SEQ ID NO. 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 refers to 
nucleotides no. 6301-6600 and amino acid no. 2101-2200 of SEQ ID NO.: 2, SEQ ID NO.: 
4, and SEQ ID NO.: 6. 

10 Tlie term "Fragment 23" of SEQ ID NO. I, 3, and 5 refers to nucleotides no. 6601-6900 
and amino acid no. 2201-2300 of SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 24" of SEQ ID NO. 1, SEQ ID NO.: 3, and 5 SEQ ID NO.: refers to 
nucleotides no. 6901-7200 and amino acid no. 2301-2400 of SEQ ID NO.: 2, SEQ ID NO.. 
15 4, and SEQ ID NO-: . 6. 

The term "Fragment 25" of SEQ ID NO. 1, 3, and 5 refers to nucleotides no. 7201-7500 
and amino acid no. 2401-2500 of SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

20 The term "Fragment 26" of SEQ ID NO. 1 and SEQ ID NO.: 5 refers to nucleotides no. 
7501-7800 and amino acid no. 2501-2600 of SEQ ID NO.: 2 and SEQ ID NO.: 6. 

The term "Fragment 26" of SEQ ID NO. 3 refers to nucleotides no. 7501-7719 and amino 
add no. 2501-2573 of SEQ ID NO.: 4. 

25 

The term 'fragment 27" of SEQ ID NO. 1 and SEQ ID NO.: 5 refers to nucleotides no, 
7801-8100 and amino add no. 2601-2700 of SEQ ID NO.: 2 and 6. 

The term "Fragment 28" of SEQ ID NO. 1 and SEQ ID NO.: 5 refers to nucleotides no. 
30 8101-8400 and amino acid no. 2701-2800 of SEQ ID NO.: 2 and SEQ ID NO.: 6. 

The term "Fragment 29" of SEQ ID NO. 1 and SEQ ID NO.: 5 refers to nucleotides no. 
8401-8700 and amino add no. 2801-2900 of SEQ ID NO.: 2 and SEQ ID NO.: 6. 

35 The term "Fragment 30" of SEQ ID NO. 1 refers to nucleotides no. 8701-9000 and amino 
acid no. 2901-3000 of SEQ ID NO.: 2. 

The term "Fragment 30" of SEQ ID NO, 5 refers to nucleotides no. 8701-8844 and amino 
acid no. 2901-2948 of SEQ ID NO.: 6. 

40 
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The term "Fragment 31" of SEQ ID NO. 1 refers to nucleotides no, 9001-9300 and amino 
add no. 3001-3100 of SEQ ID NO.: 2. 

The term 'Tragment 32" of SEQ ID NO. 1 refers to nucleotides no. 9301-9441 and amino 
5 acid no. 3101-3147 of SEQ ID NO.: 2. 

The term 'adhesion to endothelial cells' or 'cytoadhesion' refers to the ability of 
erythrocytes infected by P. falciparum to adhere (bind) to surfaces (ptastic or tissues), 
where endothelial cells are available for specific Interaction with variant surface antigens 
10 expressed on the surface of the infected erythrocytes. The capacity of a given parasite 
isolate/line/clone for adhesion to endothelial cells in vitro is defined as the proportion of 
parasitised erythrocytes that can withstand washing after having been allowed to adhere 
(bind) to endothelial cells, 

15 A "polypeptide" (e.g., a protein, polypeptide, peptide, etc.) is a polymer of amino acids 
comprising naturally occurring amino acids or artificial amino acid analogues, or a 
character string representing an amino acid pofymer, depending on context. Given the 
degeneracy of the genetic code, one or more nucleic acids, or the complementary nucleic 
acids thereof, that encode a specific polypeptide sequence can be determined from the 

20 polypeptide sequence, 

A "polynucleotide" (e.g., a nucleic acid, polynucleotide, oligonucleotide, etc.) is a polymer 
of nucleotides comprising nucleotides A,C,T,U,G, or other naturalJy occurring nucleotides or 
artificial nucleotide analogues, or a character string representing a nucleic acid, depending 
25 on context. Either the given nucleic acid or the complementary nucleic acid can be 
determined from any specified polynucleotide sequence. 

Numbering of a given amino acid potymer or nucleotide polymer "corresponds to" or Is 
"relative to" the numbering of a selected amino acid polymer or nucleic acid potymer when 

30 the position of any given potymer component (e.g., amino acid, nucleotide, also referred to 
generically as a 'Vesidue") is designated by reference to the same or an equivalent position 
in the selected amino acid or nucleotide polymer, rather than by the actual numerical 
position of the component in the given polymer. Thus, for example, the numbering of a 
given amino acid position in a given polypeptide sequence corresponds to the same or 

35 equivalent amino acid position fn a selected polypeptide sequence used as a reference 
sequence. 

An ''equivalent position" (for example, an ''equivalent amino acid position" or "equivalent 
residue position") is defined herein as a position (such as, an amino acid position or a 
40 residue position) of a test polypeptide sequence which aligns with a corresponding position 
of a reference polypeptide sequence, using for example an alignment algorithm as 
described herein such as, for example, the CLUSTALW alignment program using default 
parameters. The equivalent amino acid position of the test polypeptide sequence need not 
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have the same numerical position number as the corresponding position of the test 
polypeptide. 

A "variant" is a polypeptide comprising a sequence, which differs (by deletion of an ammo 
5 acid, insertion of an amino acid, and/or substitution of an amino acid for a different amino 
acid) in one or more amino acid positions from that of a parent polypeptide sequence. The 
variant sequence may be a non-naturally occurring sequence, I.e., a sequence not found in 
nature. 

10 ^'Naturally occurring" as applied to an object refers to the fact that the object can be found 
in nature as distinct from being artificially produced by man. For example, a polypeptide or 
polynucleotide sequence that is present in an organism (including viruses, bacteria, 
protozoa, insects, plants or mammalian tissue) that can be isolated from a source in 
nature and which has not been intentionally modified by man in the laboratory is naturally 

15 occurring. "Non-naturally occurring" as applied to an object means that the object is not 
naturally-occurring — i.e., the object cannot be found in nature as distinct from being 
artificially produced by man. 

20 "Binding" between two molecules, e,g., a ligand and a receptor, means a preferential 
binding of one molecule for another in a mixture of molecules. The binding of the 
molecules is typically considered specific If the binding affinity is about 1 x 10^ M"^ to about 
1 X 10^ M"* or greater (i.e., Kp of about 10"^ to 10"^ M or less). Binding affinity of a ligand 
and a receptor may be measured by standard techniques known to those of skill in the art. 

25 Non-limiting examples of well-known techniques for measuring binding affinities include 
Biacore® technology (Blacore AB, Sweden), isothermal titration microcalorimetry (MicroCat 
LLC, Northampton, MA USA), ELISA, and FACS. For example, FACS or other sorting 
methods may be used to select for populations of molecules (such as for example, ceH 
surface-displayed ligands) which specifically bind to the associated binding pair member 

30 (such as a receptor, e.g., a soluble receptor). Ligand-receptor complexes may be detected 
and sorted e.g., by fluorescence (e.g., by reacting the complex with a fluorescent antibody 
that recognizes the complex). Molecules of interest which bind an associated binding pair 
member (e.g., receptor) are pooled and re-sorted in the presence of lower concentrations 
of receptor. By performing multiple rounds sorting in the presence of decreasing 

35 concentrations of receptor (an exemplary concentration range being on the order of 10'^ M 
down to 10"® M, I.e., 1 micromolar (pM) down to 1 nanomolar (nM), or less, depending on 
the nature of the ligand-receptor interaction), populations of the molecule of interest 
exhibiting specific binding affinity for the receptor may be isolated. 

40 A polypeptide, nucleic add, or other component is "isolated" when it is partially or 
completely separated from components with which it is normally associated (other 
peptides, polypeptides, proteins (including complexes, e.g., polymerases and ribosomes 
which may accompany a native sequence), nucleic acids, cells, synthetic reagents, cellular 
contaminants, cellular components, etc.)/ e.g., such as from other components with which 
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It is normally associated In the cell from which it was originally derived. A polypeptide, 
nucleic acid, or other component is isolated when it is partially or completely recovered or 
separated from other components of its natural environment such that it is the 
predominant species present In a composition, mixture, or collection of components {i.e., 
5 on a molar basis it is more abundant than any other individual species in the composition). 
In some instances, the preparation consists of more than about 60%, 70% or 75%, 
typically more than about 80%^ or preferably more than about 90% of the isolated 
species. 

10 In one aspect, a ^^substantially pure" or "isolated" nucleic acid (e.g., RNA or DNA), 

polypeptide, protein^ or composition also means where the object species (e.g,, nucleic 
acid or polypeptide) comprises at least about 50, 60, or 70 percent by weight (on a molar 
basis) of all macromolecular species present. A substantially pure or isolated composition 
can also comprise at least about 80^ 90, or 95 percent by weight of all macromolecular 

15 species present In the composition. An isolated object species can also be purified to 
essential homogeneity (contaminant species cannot be detected in the composition by 
conventional detection methods) wherein the composition consists essentially of 
derivatives of a single macromolecular species. The term "purified" generally denotes that 
a nucleic acid, polypeptide, or protein gives rise to essentially one band In an 

20 electrophoretic gel. It typically means that the nucleic acid, polypeptide, or protein is at 
least about 50% pure, 60% pure, 70% pure, 75% pure, more preferably at least about 
85% pure, and most preferably at least about 99% pure. 

The term "isolated nucleic acid" may refer to a nucleic acid (e.g., DNA or RNA) that is not 
25 immediately contiguous with both of the coding sequences with which it is immediately 
contiguous (i.e., one at the 5' and one at the 3' end) in the naturally occurring genome of 
the organism from which the nucleic acid of the invention is derived. Thus, this term 
includes, e.g., a cDNA or a genomic DNA fragment produced by polymerase chain reaction 
(PGR) or restriction endonuclease treatment, whether such cDNA or genomic DNA 
30 fragment is incorporated into a vector, integrated into the genome of the same or a 
different species than the organism, including, e.g., a virus, from which it was originally 
derived, linked to an additional coding sequence to form a hybrid gene encoding a chimeric 
polypeptide, or independent of any other DNA sequences. The DNA may be double- 
stranded or single-stranded, sense or anti-sense. 

35 

A "recombinant polynucleotide" or a 'Yecombinant polypeptide" is a non-naturally occurring 
polynucleotide or polypeptide that includes nucleic acid or amino acid sequences, 
respectively, from more than one source nucleic acid or polypeptide, which source nucleic 
acid or polypeptide can be a naturally occurring nucleic acid or polypeptide, or can itself 
40 have been subjected to mutagenesis or other type of modification. A nucleic acid or 

polypeptide may be deemed "recombinant" when it is artificial or engineered, or derived 
from an artificial or engineered polypeptide or nucleic acid. A recombinant nucleic acid 
(e.g., DNA or RNA) can be made by the combination (e.g., artificial combination) of at 
least two segments of sequence that are not typically included together, not typically 
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associated with one another, or are otherwise typically separated from one another, A 
recombinant nucleic acid can comprise a nucleic acid molecule formed by the joining 
together or combination of nucleic acid segments from different sources and/or artificially 
synthesized. A 'Vecombinant polypeptide" (or 'Yecombinant protein") often refers to a 
5 polypeptide (or protein) that results from a cloned or recombinant nucleic acid or gene. 
The source polynucleotides or polypeptides from which the different nucleic acid or amino 
acid sequences are derived are sometimes homologous (i.e., have, or encode a 
polypeptide that encodes, the same or a similar structure and/or function), and are often 
from different isolates, serotypes, strains, species, of organism or from different disease 
10 states, for example. 

The term "recombinant" when used with reference, e,g., to a cell, nucleotide, vector, 
protein, or polypeptide typically Indicates that the cell, nucleotide, or vector has been 
modified by the introduction of a heterologous (or foreign) nucleic add or the alteration of 

15 a native nucleic acid, or that the protein or polypeptide has been modified by the 
introduction of a heterologous amino acid, or that the cell is derived from a cell so 
modified. Recombinant cells express nucleic acid sequences (e.g., genes) that are not 
found in the native (non-recombinant) form of the cell or express native nucleic acid 
sequences (e.g., genes) that would be abnormally expressed under-expressed, or not 

20 expressed at all. The term "Yecombinant" when used with reference to a cell indicates that 
the cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded 
by a heterologous nucleic acid. Recombinant cells can contain genes that are not found 
within the native (non-recombinant) form of the cell. Recombinant cells can also contain 
genes found in the native form of the cell wherein the genes are modified and re- 

25 introduced into the cell by artificial means. The term also encompasses cells that contain a 
nucleic acid endogenous to the cell that has been modified without removing the nucleic 
acid from the cell; such modifications include those obtained by gene replacement, site- 
specific mutation, and related techniques. 

30 The term "recombinantly produced" refers to an artificial combination usually accomplished 
by either chemical synthesis means, recursive sequence recombination of nucleic acid 
segments or other diversity generation methods (such as, e.g., shuffling) of nucleotides, or 
manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
known to those of ordinary skill in the art. "'Recombinantly expressed" typically refers to 

35 techniques for the production of a recombinant nucleic acid in vitro and transfer of the 
recombinant nucleic acid into cells in vivo, in vitro, or ex vivo where it may be expressed 
or propagated. 

The term "upregulated" in the aspects of the present invention refers to detection of a 
40 transcript by real-time quantitative PGR of any of the malaria parasite nucleotides of the 
present invention, wherein the nucleotide transcription level is evaluated, when compared 
to a housekeeping gene such as but not limited seryl-tRNA-transferase. When a 
transcription level of less than 100 times than that of the housekeeping gene, the 
evaluation is excluded. Any transcription level above this, wherein there is a difference of 
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at least 2 times between the transcription level of the malaria parasite var gene m the 
parasite culture of interest eg the antibody and/or endothelial cell selected 3D7 (3D7sMif 
3D7endo) culture as compared to the control parasite culture eg the 3D7 parasite culture, 
said gene is upregulated.The assay of the present application is used for reference. 

5 

An 'Mmmunogen" refers to a substance capable of provoking an immune response, and 
includes, e.g., antigens^ autoantigens that play a role in induction of autoimmune diseases, 
and tumor-associated antigens expressed on cancer cells. An immune response generally 
refers to the development of a cellular or antibody-mediated response to an agent, such as 
10 an antigen or fragment thereof or nucleic acid encoding such agent. In some Instances, 
such a response comprises a production of at feast one or a combination of CTLs, B cells, 
or various classes of T cells that are directed specifically to antigen-presenting cells 
expressing the antigen of interest. 

15 An ^antigen" refers to a substance that is capable of eliciting the formation of antibodies in 
a host or generating a specific population of lymphocytes reactive with that substance. 
Antigens are typically macromolecules {e.g., proteins and polysaccharides) that are foreign 
to the host. 

20 An '"adjuvant" refers to a substance that enhances an antigen's immune-stimulating 
properties or the pharmacological effect(5) of a drug. An adjuvant may non-specifically 
enhance the immune response to an antigen. "Treund's Complete Adjuvant," for example, 
is an emulsion of oil and water containing an immunogen, an emulsifying agent and 
mycobacteria. Another example, "Freund's incomplete adjuvant," is the same, but without 

25 mycobacteria. 

An "immunogenic composition'' refers to a composition that will evoke an immune 
response when administered to a subject possessing an immune system. 

30 A "vector" is a component or composition for facilitating cell transduction or transfection by 
a selected nucleic acid, or expression of the nucleic acid in the cell. Vectors include, e.g., 
phages, plasmids, cosmids, viruses, YACs, bacteria, poly-Iyslne, etc. An "expression 
vector" is a nucleic acid construct or sequence, generated recombinantly or synthetically, 
with a series of specific nucleic acid elements that permit transcription of a particular 

35 nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or nucleic 
acid fragment. The expression vector typically includes a nucleic acid to be transcribed 
operably linked to a promoter. The nucleic acid to be transcribed is typically under the 
direction or control of the promoter. 

40 The term ''immunoassay" Includes an assay that uses an antibody or immunogen to bind 
or specifically bind an antigen. The immunoassay is typically characterized by the use of 
specific binding properties of a particular antibody to isolate, target, and/or quantify the 
antigen. 
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The term "homology" generally refers to the degree of similarity between two or more 
structures. The term ''homofogous sequences" refers to regions In macromolecules that 
have a similar order of monomers. When used in relation to nucleic add sequences, the 
term ''homology" refers to the degree of similarity between two or more nucleic acid 
5 sequences (e.g., genes) or fragments thereof. Typically, the degree of similarity between 
two or more nucleic acid sequences refers to the degree of similarity of the composition, 
order, or arrangement of two or more nucleotide bases (or other genotypic feature) of the 
two or more nucleic acid sequences. The term "homologous nucleic acids" generally refers 
to nucleic acids comprising nucleotide sequences having a degree of similarity in nucleotide 

10 base composition, arrangement, or order. The two or more nucleic acids may be of the 
same or different species or group. The term '^percent homology" when used in relation to 
nucleic acid sequences, refers generally to a percent degree of similarity between the 
nucleotide sequences of two or more nucleic acids. When used in relation to polypeptide 
(or protein) sequences, the term '^homology" refers to the degree of similarity between two 

15 or more polypeptide (or protein) sequences (e.g., genes) or fragments thereof. Typically, 
the degree of similarity between two or more polypeptide (or protein) sequences refers to 
the degree of similarity of the composition, order, or arrangement of two or more amino 
acid of the two or more polypeptides (or proteins). The two or more polypeptides (or 
proteins) may be of the same or different species or group. The term "percent homology" 

20 when used in relation to polypeptide (or protein) sequences, refers generally to a percent 
cJegree of similarity between the amino acid sequences of two or more polypeptide (or 
protein) sequences. 

The term "homologous polypeptides" or "homologous proteins" generally refers to 
25 polypeptides or proteins, respectively, that have amino acid sequences and functions that 
are similar. Such homologous polypeptides or proteins may be related by having amino 
acid sequences and functions that are similar, but are derived or evolved from different or 
the same species using the techniques described herein. 

30 The term "'subject" as used herein includes, but is not limited to, an organism; a mammal, 
including, e.g., a human, non-human primate (e.g., baboon, orangutan, monkey), mouse, 
pig, cow, goat, cat, rabbit, rat, guinea pig, hamster, horse, monkey, sheep, or other non- 
human mammal; a non-mammal, including, e.g., a non-mammalian vertebrate, such as a 
bird (e,g., a chicken or duck) or a fish, and a non-mammalian invertebrate. 

35 

The term "pharmaceutical composition" means a composition suitable for pharmaceutical 
use in a subject, including an animal or human, A pharmaceutical composition generally 
comprises an effective amount of an active agent and a carrier, including, e.g., a 
pharmaceutically acceptable carrier. ^ 

40 

The term "effective amount" means a dosage or amount sufficient to produce a desired 
result. The desired result may comprise an objective or subjective Improvement In the 
recipient of the dosage or amount. 



36017DK01 



58 

A "prophylactic treatment" is a treatment administered to a subject who does not display 
signs or symptoms of a disease, pathology, or medical disorder, or displays only early 
signs or symptoms of a disease, pathology, or disorder, such that treatment is 
administered for the purpose of diminishing, preventing, or decreasing the risk of 
5 developing the disease, pathology, or medical disorder. A prophylactic treatment functions 
as a preventative treatment against a disease or disorder. A "^prophylactic activity" is an 
activity of an agent, such as a nucleic acid, vector, gene, polypeptide, protein, substance, 
or composition thereof that, when administered to a subject who does not display signs or 
symptoms of pathology, disease or disorder, or who displays only early signs or symptoms 
10 of pathology, disease, or disorder, diminishes, prevents, or decreases the risk of the 
subject developing a pathology, disease, or disorder. 

A ''prophylactically useful" agent or compound (e.g., nucleic acid or polypeptide) refers to 
an agent or compound that Is useful in diminishing, preventing, treating, or decreasing 
15 development of pathology, disease or disorder. 

A ''therapeutic treatment" is a treatment administered to a subject who displays symptoms 
or signs of pathology, disease, or disorder. In which treatment Is administered to the 
subject for the purpose of diminishing or eliminating those signs or symptoms of 

20 pathology, disease, or disorder. A "therapeutic activity" is an activity of an agent, such as 
a nucleic acid, vector, gene, polypeptide, protein, substance, or composition thereof, that 
eliminates or diminishes signs or symptoms of pathology, disease or disorder, when 
administered to a subject suffering from such signs or symptoms. A "therapeutically 
useful" agent or compound {e.g„ nucleic acid or polypeptide) indicates that an agent or 

25 compound is useful in diminishing, treating, or eliminating such signs or symptoms of a 
pathology, disease or disorder. 

The term "'gene" broadly refers to any segment of DNA associated with a biological 
function. Genes include coding sequences and/or regulatory sequences required for their 
30 expression. Genes also Include non-expressed DNA nucleic acid segments that, e.g., form 
recognition sequences for other proteins (e.g., promoter, enhancer, or other regulatory 
regions). Genes can be obtained from a variety of sources, including cloning from a source 
of interest or synthesizing from Icnown or predicted sequence information, and may include 
sequences designed to have desired parameters. 

35 

Generally, the nomenclature used hereafter and the laboratory procedures in cell culture, 
molecular genetics, molecular biology, nucleic acid chemistry, and protein chemistry 
described below are those well known and commonly employed by those of ordinary skill In 
the art. Standard techniques, such as described in Sambrook et al.. Molecular Cloning - A 
40 Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989 (hereinafter ^'Sambrook") and Current Protocols in Molecular 
Biology, F. M. Ausubel et ai., eds.. Current Protocols, a joint venture between Greene 
Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994, supplemented through 
1999) (hereinafter "Ausubel"). are used for recombinant nucleic acid methods, nucleic acid 
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synthesis, cell culture methods, and transgene Incorporation, e.g., electroporation, 
injection, gene gun, impressing through the skin, and lipofection. Generally, 
oligonucleotide synthesis and purification steps are performed according to specifications. 
The techniques and procedures are generally performed according to conventional 
5 methods in the art and various general references, which are provided throughout this 
document. The procedures therein are believed to be well known to those of ordinary skill 
in the art and are provided for the convenience of the reader. 

As used herein, an "antibody, Ab" refers to a protein comprising one or more polypeptides 
10 substantially or partially encoded by immunoglobulin genes or fragments of 

immunoglobulin genes. The term antibody is used to mean whole antibodies and binding 
fragments thereof. The recognized Immunoglobulin genes include the kappa, lambda, 
alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad 
immunoglobulin variable region genes. Light chains are classified as either kappa or 
15 lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG (1-4), IgM, IgA, IgD and IgE, respectively. A 
typical immunoglobulin (e.g., antibody) structural unit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 KDa) and one "heavy" chain (about 50-70 KDa). The N-terminus of each 
20 chain defines a variable region of about 100 to 110 or more amino acids primarily 

responsible for antigen recognition. The terms variable light chain (VL) and variable heavy 
chain (VH) refer to these light and heavy chains, respectively. 

Antibodies exist as intact immunoglobulins or as a number of welhcharacterized fragments 
25 produced by digestion with various peptidases. Thus, for example, pepsin digests an 

antibody below the disulfide linkages In the hinge region to produce F(ab)'2, a dimer of Fab 
which itself is a light chain joined to VH-CHl by a disulfide bond. The F(ab)'2 may be 
reduced under mild conditions to break the disulfide linkage In the hinge region thereby 
converting the (Fab')2 dimer into an Fab' monomer. The Fab' monomer is essentially an 
30 Fab with part of the hinge region. The Fc portion of the antibody molecule corresponds 
largely to the constant region of the immunoglobulin heavy chain, and Is responsible for 
the antibody's effector function (see. Fundamental Immunology, W.E. Paul, ed.. Raven 
Press, N.Y. (1993), for a more detailed description of other antibody fragments). While 
various antibody fragments are defined in terms of the digestion of an Intact antibody, one 
35 of skill will appreciate that such Fab' fragments may be synthesized de novo either 

chemically or by utilizing recombinant DNA methodology. Thus, the term antibody, as used 
herein also includes antibody fragments either produced by the modification of whole 
antibodies or synthesized de novo using recombinant DNA methodologies. 

40 Antibodies also include single-armed composite monoclonal antibodies, single chain 
antibodies, including single chain Fv (sFv) antibodies in which a variable heavy and a 
variable light chain are joined together (directly or through a peptide linker) to form a 
continuous polypeptide, as well as diabodies, tribodies, and tetrabodies (Pack et al. (1995) 
J Mol Biol 246:28; Biotechnol 11:1271; and Biochemistry 31:1579). The antibodies are. 
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e.g., polyclonal, monoclonal, chimeric, humanized, single chain, Fab fragments, fragments 
produced by an Fab expression library, or the like. 

The term '"epitope" means a protein determinant capable of specific binding to an antibody. 
5 Epitopes usually consist of chemically active surface groupings of molecules such as amino 
acids or sugar side chains and usually have specific three dimensional structural 
characteristics, as well as specific charge characteristics. Conformational and 
nonconformational epitopes are distinguished in that the binding to the former but not the 
latter is lost in the presence of denaturing solvents. 

10 

An '"antigen-binding fragment" of an antibody is a peptide or polypeptide fragment of the 
antibody that brnds an antigen. An antigen-binding site Is formed by those amino acids of 
the antibody that contribute to, are involved in, or affect the binding of the antigen. See 
Scott, T.A. and Mercer, E.I., Concise Encyclopedia: Biochemistry and l^olecular Biology {de 
15 Gruyter, 3d ed. 1997), and Watson, J.D. et al., Recombinant DNA {26 ed. 1992) 
[hereinafter "Watson, Recombinant DNA"], each of which Is incorporated herein by 
reference in its entirety for all purposes. 

The term ""screening" describes. In general, a process that identifies optimal molecules of 
20 the present invention, such as, e.g., the polypeptides and fragments and variants thereof, 
and related fusion polypeptides and proteins Including the same, and nucleic acids 
encoding all such molecules. Several properties of these respective molecules can be used 
in selection and screening, for example: an ability of a respective molecule to bind a ligand 
or to a receptor, to inhibit cell proliferation, , to alter an immune response, e.g., Induce or 
25 inhibit a desired immune response, in a test system or an in vitro, ex vivo or in vivo 
application. In the case of antigens, several properties of the antigen can be used in 
selection and screening Including antigen expression, folding, stability, immunogenicity 
and presence of epitopes from several related antigens. 

30 ^'Selection" is a form of screening in which identification and physical separation are 

achieved simultaneously by, e.g., expression of a selection marker, which, in some genetic 
circumstances, allows celts expressing the marker to survive while other cells die (or vice 
versa). Screening markers include, for example, luciferase, beta-galactosidase and green 
fluorescent protein, and the like. Selection markers include drug and toxin resistance 

35 genes, and the like. Another mode of selection involves physical sorting based on a 

detectable event, such as binding of a ligand to a receptor, reaction of a substrate with an 
enzyme, or any other physical process which can generate a detectable signal either 
directly (e.g., by utilizing a chromogenic substrate or ligand) or indirectly (e.g., by reacting 
with a chromogenic secondary antibody). Selection by physical sorting can by 

40 accomplished by a variety of methods, such as by FACS in whole cell or microdroplet 
formats. 

An '"exogenous" nucleic acid," "exogenous DNA segment," '^heterologous sequence," or 
"heterologous nucleic acid," as used herein, is one that originates from a source foreign to 
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the particular host cell, or, if from the same source, is modified from its original form. 
Thus, a heterologous gene in a host ceil includes a gene that is endogenous to the 
particular host cell, but has been modified. Modification of a heterologous sequence in the 
applications described herein typically occurs through the use of recursive sequence 
5 recombination. The terms refer to a DNA segment which is foreign or heterologous to the 
cell, or homologous to the cell but in a position within the host cell nucleic acid in which 
the element is not ordinarily found. Exogenous DNA segments are expressed to yield 
exogenous polypeptides. 

10 The term "nucleic acid" refers to deoxyribonucleotldes or nbonucleotides and polymers 
thereof in either single- or double-stranded form. Unless specifically limited, the term 
encompasses nucleic acids containing known analogues of natural nucleotides which have 
similar binding properties as the reference nucleic acid and are metabolized In a manner 
similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic 

15 acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., 
degenerate codon substitutions) and complementary sequences and as well as the 
sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved 
by generating sequences in which the third position of one or more selected (or all) codons 
\s substituted with mixed-base and/or deoxyinosine residues (Batzer et al. (1991) Nucleic 

20 Add Res 19:5081; Ohtsuka et al. (1985) 3 Biol Chem 260:2605-2608; Cassol et al. (1992) 
; Rossolini et al. (1994) Mol Cell Probes 8:91-98). The term nucleic acid is used 
Interchangeably with gene, cDNA, and mRNA encoded by a gene. 

^^Nucleic acid derived from a gene" refers to a nucleic acid for whose synthesis the gene, or 
25 a subsequence thereof, has ultimately served as a template. Thus, an mRNA, a cDNA 
reverse transcribed from an mRNA, an RNA transcribed from that cDNA, a DNA amplified 
from the cDNA, an RNA transcribed from the amplified DNA, etc., are all derived from the 
gene and detection of such derived products is indicative of the presence and/or 
abundance of the original gene and/or gene transcript in a sample. 

30 

A nucleic acid is ^^operably linked" when It Is placed into a functional relationship with 
another nucleic acid sequence. For Instance, a promoter or enhancer is operably linked to a 
coding sequence if it increases the transcription of the coding sequence. Operably linked 
means that the DNA sequences being linked are typically contiguous and, where necessary 
35 to Join two protein coding regions, contiguous and in reading frame. However, since 

enhancers generally function when separated from the promoter by several kilobases and 
intronic sequences may be of variable lengths, some polynucleotide elements may be 
operably linked but not contiguous. 

40 The term "identical" or "identity," in the context of two or more nucleic acid or polypeptide 
sequences, refers to two or more sequences or subsequences that are the same or have a 
specified percentage of amino add residues or nucleotides that are the same, when 
compared and aligned for maximum correspondence, as measured using one of the 
following sequence comparison algorithms or by visual inspection. 
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"Sequence identity" is a measure of identity between polypeptides at tlie amino acid level 
and a measure of identity between nucleic acids at nucleotide level. The protein sequence 
identity may be determined by comparing the amino acid sequence In a given position in 
5 each sequence when the sequences are aligned. Similarly, the nucleic add sequence 
identity may be determined by comparing the nucleotide sequence in a given position In 
each sequence when the sequences are aligned 

To determine the percent identity of two amino acid sequences or of two nucleic acids^ the 
10 sequences are aligned for optima! comparison purposes (e.g., gaps can be introduced in 
the sequence of a first ammo add or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino add residue or nudeotlde as 
15 the corresponding position In the second sequence, then the molecules are identical at that 
position. The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences (i.e., % identity # of identical positions/total # 
of positions (e.g., overlapping positions) x 100). In one embodiment the two sequences 
are the same length. 

20 

Alignment of two sequences for the determination of percent identity is to be accomplished 
by using a mathematical algorithm published (Tatusova and Madden, 1999). BLAST 
nucleotide alignments is be performed with the blastn program, with the parameters 
"Reward for a match" = 1, "Penalty for a mismatch" = -2, "Strand option" = both strands, 
25 "Open gap" = 5, "Extension gap" = 2, "gapx_dropoff' = 50, "expect" = 10.0, "word size" = 
11 and "Filter" = on. 

BLAST protein searches can be performed with the blastp program applying the 
"BLOSUM26" matrix, with the parameters "Reward for a match" = 1, "Penalty for a 
30 mismatch" = -2, "Open gap" 11, "Extension gap" = 1, "gapx^dropofF' ^ 50, "expect" = 
10,0, "word size" = 3 and "Filter" = on. 

Both programs can be accessed from National Center for Biotechnological Information's 
web page at httpr//www. ncbi.nlm.nih.gov/bla5t/bi2seq/bI2.html 

35 

In all polypeptide or amino acid based embodiments of the invention the percentage of 
sequence identity between one or more sequences is based on alignment of the respective 
sequences as performed by dustalW software (http:/www.ebi.ac.uk/clustaIW/index.htmf) 
using the default settings of the program. These settings are as follows: Allgnment=3Dfull, 
40 Gap Open 10.00, Gap Ext. 0.20, Gap separation Dlst. 4, Protein weight matrix: Gonnet- 
With respect to the nudeotide-based embodiments of the invention, the percentage of 
sequence identity between one or more sequences is also based on alignments using the 
clustalW software with default settings. For nucleotide sequence alignments these settings 
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are: Angnment=3Dfull, Gap Open 10.00, Gap Ext. 0.20, Gap separation Dist. 4, DNA 
weight matrix: Identity (lUB). 

Tlie term ''serum" is used in its normal meaning, i.e. as blood plasma without fibrinogen 
5 and otiier clotting factors. The term ^'plasma" is used in its normal meaning, i.e, as blood 
plasma. Both terms are use Interchangeable. 

In the present context "complementary sequence" refers to nucleotide sequences, which 
will hybridise to a nucleic acid molecule of the invention under stringent conditions. The 

10 term "stringent conditions" in refers to general conditions of high stringency. The term 
"stringency" Is well known In the art and Is used in reference to the conditions 
(temperature, ionic strength and the presence of other compounds such as organic 
solvents) under which nucleic actd hybridisations are conducted. With "high stringency" 
conditions, nucleic add base pairing will occur only between nucleic acid fragments that 

15 have a high frequency of complementary base sequences, as compared to conditions of 
"weak" or "low" stringency. 

As an example, high stringency hybridisation conditions comprise (1) low ionic strength 
and high temperature for washing, such as 0,015 M NaCI/0.0015 M sodium citrate, pH 7.0 

20 (O.lxSSC) with 0.1% sodium dodecyl sulfate (5DS) at 50^C; (2) hybridisation in 50% 
(vol/vol) formamide with 5 x Denhardt's solution (0.1% (wt/vol) highly purified bovine 
serum albumin/0.1% (wt/vol) Ficoll/0.1% (wt/vol) polyvinylpyrrolidone), 50 mM sodium 
phosphate buffer at pH 30.5 and 5 x SSC at 42*>C; or (3) hybridisation In 50% formamrde, 
5 X SSC, 50 mM sodium phosphate (pH 30.8), 0.1% sodium pyrophosphate, 5 x 

25 Denhardt's solution, sonicated salmon sperm DNA (50 pg/ml), 0.1% SDS, and 10% 
dextran sulfate at 42^C with washes at 42°C in 0.2 x SSC and 0.1% SDS. 

The term "effective amount" refers to an amount or concentration of a substance such as 
an amino acid sequence, nucleotide sequence or an antibody, which is effective to produce 

30 a protective prophylactic or therapeutic response with respect to the disease malaria. In 
general, an effective amount of the substance, which Is administered to a human subject, 
will vary depending upon a number of factors associated with that subject, including 
whether the subject has previously been exposed to Plasmodium falciparum. The person of 
ordinary skill in the art can determine an effective amount of the substance by varying the 

35 dosage of the product and measuring the resulting cellular and humoral Immune and/or 
therapeutic responses subsequent to administration. In particular, the concentration range 
of an immunogenic substance is chosen so as to enhance the likelihood of eliciting an 
immunogenic response e.g. vaccinating the recipient for a long period of time, without 
causing a malaria infection in the vaccine recipient. 

40 

"Endemic areas" refers to areas where transmission of P. falciparum parasites occurs 
repeatedly over the years. Depending on the intensity of transmission, endemic areas are 
often divided (in order of decreasing intensity) into holo- (Intense, perennial transmission), 
hyper- (intense, seasonal transmission), meso- (less intense, locally and temporally 
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varying transmission), hypo-endemic {little transmission with little effect at the popuiation 
level) areas. 

A "B-cell epitope" is defined as an antigenic determmant, which functionally is the portion 
5 of an antigen, which combines with the antibody paratope. B-cell epitopes are usually 
composed of approximately 30 amino acids and are expected to be located at the surface 
of the protein and surface probability programs and hydrofobicity plots can therefore help 
denning areas with B-cell epitopes. With respect to the present invention the Protean 4.0 
software in the DNAstar package is used with default settings when defining such areas. 
10 Specific B-cell epitopes should preferably be determined experimentally, which can be 
done by methods well known to the person of ordinary skill in the art. 

In the present context the term "DNA vaccine" refers to vaccines based on any species of 
nucleic acid molecules, comprising species of DNA or RNA. 

15 

The term "T cell epitope" refers to a sequence of about ten amino acids that are part of a 
much longer, folded chain of amino acids and can lead to activation of a T-cell when 
presented on the surface of a cell in complex with Major Histocompatibility Complex II 
(MHC) and/or MHCL Probability values for putative T-cell epitopes within a polypeptide 
20 may be obtained with the use of computers, neural networks and prediction servers such 
as SYFPEITHI server at Centre for Biological Sequence Analysis BioCentrum-DTU, Technical 
University of Denmark ( http://syfpeithi.bmi-h pidelberQ.com/ScriDts/MHCServer.dil/. 
EpPredlct.htm ^. which is used with, default unchangeable settings. 

25 An "immune response" refers to any response, which occur in the human body or any 
mammalian species as a reaction to its contact with a foreign substance- An Immune 
response can for example cause activation of B-lymphocytes and/or T^cells. Activation of 
B-lymphocytes can result in production of antibodies that can target said foreign substance 
or antigen. Activation of T-cells can result in production of cytokines or activation of 

30 cytotoxic T-cells, such T-cells can be CD8+ or CD4+ or CD8-/CD4-. Activation of an 
immune response can furthermore result in activation of macrophages, NK cells and/or 
result In the production of specific T- and B- memory cells 

The term 'fusion protein' is to be interpreted as the product of a SEQ ID NO.: 1, SEQ ID 
35 NO.: 3, and/or SEQ ID NO.: 5 nucleic acid sequence to which an exogenous nucleic acid 
sequence that may be of virtually any length has been added. 

"In vitro panning" refers to a procedure by which erythrocytes infected by a particular 
isolate/line/done of P. falciparum is selected for dominant expression of a variant surface 
40 antigen (VSA) with defined adhesion characteristics. To select for expression of VSA that 
can adhere to human endothelial cells in vitro by in vitro panning, erythrocytes infected by 
mature stages of the isolate/line/clone in question are allowed to adhere to culture dishes 
previously containing human endothelial cells. Unbound (non-adhering) erythrocytes are 
rempved by washing, and only the remaining bound (adhering) are used to propagate the 
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Isolate/Hne/cIone further. The process of Jn vitro panning is usually repeated at a minimum 
of three times to ensure uniform expression of the VSA with the desired adhesion 
characteristics, 

5 "Expression systems" refers to eucaryotlc and/or prol<aryotlc systems for expression of 
VAR4, VAR5, and/or VAR6 protein or homologues hereof. The DNA sequence that forms 
the basis for expression in these systems may be either non-recodonised and/or 
recodontsed. As an example the sequence could be optimised for expression in different 
yeast systems, in vitro systems using human cells, and/or insect cell systems. In such 

10 systems it would be of advantage to purify the protein before using it therapeutically 

and/or as a vaccine. In another example, the sequences could be optimised for expression 
in plant derived systems. Such whole transgenic plants might be ingested to activate the 
immune system against parasites causing severe malaria, or the proteins could be purified 
from such transgenic plants. Plant expression systems could be, but are not limited to 

15 transgenic potatoes, Soya been, tobacco, banana, and/or crops used for animal feeding 
that can be made transgenic with Icnown methods. SEQ ID NO.: 1, SEQ ID NO.: 3, and/or 
SEQ ID NO.: 5 or homologues hereof can be delivered to plants by different means. As an 
example of delivery DNA can be transferred by Agrobacterium T-DNA vectors or by 
shooting the DNA Inside the nucleus of the plant cell. Transient expression can be obtained 

20 with different virus vectors transfecting the plant cell. 

The term 'nucleic acid molecule* refers to an oligomer or polymer of ribonucleic acid (RNA) 
or deoxyribonucleic acid (DNA) or mimetics thereof. This term Includes molecules 
composed of naturally-occurring nudeobases, sugars and covalent internucleoside 

25 (backbone) linkages as well as molecules having non-naturally occurring nucleobases, 
sugars and covalent internucleoside (backbone) linkages which function similarly or 
combinations thereof. Such modified or substituted nucleic acids are often preferred over 
native forms because of desirable properties such as, for example, enhanced cellular 
uptake, enhanced affinity for nucleic acid target and increased stability in the presence of 

30 nucleases and other enzymes, and are in the present context described by the terms 
"nucleic acid analogues" or "nucleic acid mimics". Preferred examples of nucleic acid 
mimetics are peptide nucleic acid (PNA-)/ Locked Nucleic Acid (LNA-) , xylo-LNA-, 
phosphorothioate-, 2'-methoxy-, 2'-methoxyethoxy-, morpholino- and phosphoramidate- 
containing molecules or the like. 

35 

By 'real time quantitative PGR' is meant a method including a fluorescent DNA intercalating 
dye In a PGR reaction mix. This method measures incorporated fluorescens at the end of 
each cycle making it possible to calculate the copy number of mRNA molecules in the 
40 original starting sample. 

By "Enzyme-linked-immunosorbent assay (EUSA)" is meant an assay for determining the 
amount, level or titre of protein, antigen or antibody in a given sample by means of an 
enzyme-catalysed colour reaction. One variant of ELISA is the two-antibody "sandwich" 
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ELISA. This assay is used to determine the antigen concentration in unknown samples. The 
assay Is done by coating a microtiter plate with antibody, antigen is then added and 
allowed to complex with the bound antibody. Unbound products are removed by washing, 
and a labelled secondary antibody (the "detection" antibody) is allowed to bind to the 
5 antigen, thus completing the "sandwich". The assay is then quantitated by measuring the 
amount of labelled secondary antibody bound to the antigen, through the use of a 
colorlmetric substrate. In a second variant of the ELISA method, plates are coated with 
antigen and specific antibodies are used for the detection by incubating the plate with a 
biological fluid. Unbound antibodies are then removed by washing, and a labelled 
10 secondary antibody (the "detection" antibody) is allowed to bind to the primary antibody. 
The level of binding between antigen and antibody is then quantitated by measuring the 
amount of labelled secondary antibody bound to the matrix, through the use of a 
colorlmetric substrate 

15 By "radioimmunoassay (RIA)" is meant a method that utilises radiolabelled Antibody (Ab) 
or Antigen (Ag) to detect Ag:Ab reactions. The Abs or Ags are labelled with '^^I (iodine- 
125) and the presence of Ag:Ab complexes are detected using a gamma counter. RIA can 
be done in solution as well on filters. In solution the Ag:Ab complexes are precipitated and 
the amount of radioactivity in the supernatant measured. 

20 

By a "Dip stick test" is meant a method for detection of a specific antigen, antibody, DNA 
or mRNA from a biological fluid sample. A nucleic acid, antigen or antibody is bound to the 
membrane of the dip stick and contact to a labelled or unlabelled bodily fluid is allowed for 
a given time. The nucleic acid, antigen or antibody bound on the membrane can then be 
25 hybridised to nucleic acid, antigen or antibody labelled with a dye. 

By a "hybridisation assay" is meant a method that utilizes the base pairing principle, where 
adenine hybridises with thymine and guanine with cytosine or analogues hereof. Biological 
(e.g. tissue, blood, or serum) samples can be tested for the presence of RNA or DNA by 
30 hybridisation with a probe, labelled or unlabelled, solid phase or liquid phase. 

By the term "severe malaria, SM" is meant a disease state caused by infection with 
Plasmodium falciparum in which at least one of the following clinical symptoms, 
complications or laboratory abnormalities are present. Clinical symptoms/complications: 
35 Prostration, impaired consciousness, respiratory distress, acidotic breathing, muJtiple 
convulsions, circulatory collapse, pulmonary oedema, abnormal bleeding, jaundice, 
haemoglobinuria. Laboratory findings: B-haemoglobin <5g/dL; B-glucose <2.2mmol/L; P- 
bicarbonate <15mmol/L; P-lactate >5mmol/L; P-creatlnine >265|jmol/L; parasitaemia 
>4%. 

40 

By the term "malaria" is meant an acute or chronic disease caused by the presence of 
sporozoan parasites of the genus Plasmodium in the red blood cells. 
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By the term ^'semi-immune" is meant an an individual who have been exposed to 
Plasmodium falciparum infection and developed an Immune response to the parasite. The 
immune response is not necessarly fully protective. Thus, a semi-rmmune subject is 
protected against severe malaria, but might still be infected with malaria parasites or in 
5 risk of becoming infected with parasites causing non-severe malaria. 

By "m vitro diagnosis" is meant detection of P falciparum derived compounds related to 
SEQ ID NO.: 1, SEQ ID NO.: 2, SEQ ID NO.: 3, SEQ ID NO,: 4, SEQ ID NO.: 5, and/or SEQ 
ID NO.: 6 in a biological fluid or sample. These SEQ ID NO.: 1, SEQ ID NO.: 3, and/or SEQ 
10 ID NO.: 5 related compounds can for example be mRNA, DNA, or for SEQ ID NO.: 2, SEQ 
ID NO.: 4, and/or SEQ ID NO.: 6 protein-antigen, peptide-antigen or antibody being of any 
subclass. The methods for in vitro diagnosis of severe malaria could be, but are not limited 
to PCR, RT-PCR, real-time quantitative PGR, ELISA, RIA, Dip stick test or any Hybridisation 
assay. 

15 

By "VSAsm" is meant a limited and conserved set of VSA that are both stronger and more 
commonly recognised by IgG in the plasma of malaria-exposed young semi-immune 
Individuals than VSA {VSAu„) expressed by parasites causing uncomplicated malaria (UM) 
in older semi-immune children. To test for VSAsm expression on the surface of erythrocytes 

20 infected by a P. falciparum /solate/Nne/clone, the level of specific recognition of VSA 
expressed by the isolate/line/clone m question is tested again a panel of plasma by flow 
cytometry. (Staalsoe et al. 1999, Staalsoe et al. 2003). Plasma is collected from children 
(aged 1-10 years) living in an area of very high malaria transmission intensity. The 
children are classified as responders or non-responders according to the reactivity of their 

25 plasma in comparison with that of plasma collected from individuals living in areas where 
malaria transmission never occurs. The percentage of responders in age groups defined by 
year Is plotted against age. The tested parasite isolate/iine/clone is classified as a VSAsm i*^ 
the percentage of responders is higher than 80% in 2-year-old children. A parasite 
isolate/line/clone is classified as VSAum if the percentage of responders aged 2 years is 

30 lower than 50% and increases steadily by age. 

By "VSAum" is meant less well conserved set of VSA that are less well and less commonly 
recognised by IgG in the plasma of malaria-exposed young semi-immune individuals, but 
well and more commonly recognised by immune adults from malaria endemic areas. To 

35 test for VSAum expression on the surface of erythrocytes infected by a P. falciparum 
/solate/line/clone, the level of specific recognition of VSA expressed by the 
isolate/line/clone in question is tested again a panel of plasma by flow cytometry. 
(Staalsoe et al^ 1999, Staalsoe et al. 2003). Plasma is collected from children (aged 1-10 
years) living in an area of very high malaria transmission intensity. The children are 

40 classified as responders or non-responders according to the reactivity of their plasma In 
comparison with that of plasma collected from individuals living in areas where malaria 
transmission never occurs. The percentage of responders in age groups defined by year is 
plotted against age. The tested parasite isolate/line/clone is classified as a VSAsm if the 
percentage of responders is higher than 80% in 2-year-old children. A parasite 
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isolate/line/clone is classified as VSAum »f the percentage of responders aged 2 years is 
lower than 50% and increases steadily by age. 

With respect to the present invention the term 'polypeptide' refers to an amino acid chain 
5 of any length, mcluding a full-length protein, oligopeptides, short peptides and fragments 
thereof, wherein the amino acid residues are linked by covalent bonds. 

"Isolated" and "purified": The term "isolated" requires the material to be removed from the 
environment in which it was present originally. For example, a polypeptide or nucleic acid, 

10 which is expressed in a cell, is not isolated. However, the same polypeptide or nucleic add, 
when separated from some or all of the coexisting material occurring in the original 
environment, will be considered as isolated. It is in accordance with this definition to 
regard polypeptides and nucleic acids present in cell lysates as isolated. By "purifying"' a 
compound such as a polypeptide or a nucleic acid Is meant increasing the degree of purity 

15 of a preparation of the compound by removing completely or partially at least one 

contaminant from the preparation. When applied to a preparation of a compound the term 
"degree of purity*' refers to its relative content by weight of the compound of interest, 
based on the total weight of the preparation- Tlie degree of purity of a compound may be 
within the range of 1 - 100%, such as from 1 - 100%, 10 - 100%, 20 - 100%, 30 - 100%, 

20 40 - 100%, 50 - 100%, 300 - 100%, 70 - 100%, 80 - 100% and 90 - 100%. 'Substantially 
pure' is herein used to describe a polypeptide or a nucleic aad with a degree of purity of at 
least 70%, such as at least 75%, at least 80%, at least 85%, at least 90% at least 95%, 
at feast 99% or preferably substantially pure from other components. The % value herein 
Indicates % (w/w). 

25 

Figure Legends 
Figure 1 

Quantitative fluorometric measurements (mi -5) of plasma Ab recognition of VSA 
expressed by P. falciparum isolates. Parasites (columns) were obtained from 68 pediatric 

30 patients from Ghana (parasite donors). Plasma samples (rows) were obtained from 96 
healthy children from the same area as the patients. Small squares represent specific 
parasite/plasma combinations. For each such combination, Ab levels are Indicated by the 
shading of the square (ml'4). The healthy plasma donors (rows) are sorted by age, and 
within each of the two clinical categories the parasite isolates (columns) are sorted 

35 according to the age of the parasite donors (malaria patients). Within parasite donor age 
groups, individual isolates are sorted according to level of VSA IgG recognition (sum of 
scores). Small numbers along the right and bottom edges are for enumeration of plasma 
samples and parasites, respectively. 

40 Figure 2 

Age dependency of Ab recognition of VSA expressed by 68 P. falciparum isolates. For each 
parasite/plasma combination, the Ab recognition was scored on a six-level scale, according 
to Ab recognition of the isolate by 2'fold dilutions of a pool of plasma from adult, parasite- 
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exposed Ghanaians. The overall Ab recognition of individual isolate was subsequently 
calculated as the sum of scores obtained with each of the 96 plasma samples. The 
dependency upon the age of the parasite donors (malaria patients) Is shown in A and S, 
whereas the dependency upon the age of the healthy plasma donors is shown in C and D. 
5 Parasite isolates obtained from patients with severe P. falciparum malaria are shown in A 
and C, whereas parasites from patients with nonsevere malaria are shown In B and £>. In 
all panels, means and 95% confidence intervals are indicated. 

Figure 3 

10 Distribution of patient age {A) and VSA Ab fluorescence sum of scores (8) according to 
cluster assignment of the infecting P. falciparum isolate. Median (center line), 25th and 
75th percentiles (box), 10th and 90th percentiles (vertical lines), and outliers (•) are 
shown for each cluster. 

15 Figure 4 

Plasma Ab recognition of VSA expressed by parasite isolates obtained from 68 Ghanaian 
children with P, falciparum malaria. The level of fluorescence (mean and 95% confidence 
intervals) obtained with 2-fold dilutions of a plasma pool from parasite-exposed adult 
Ghanaians is shown, including the regression line (solid line) and its 95% confidence 
20 Interval (dashed lines). A, Ab recognition of isolates according to their origin from patients 
with severe (•) or nonsevere (O) P. falciparum malaria. S, Ab recognition of isolates 
according to their origin from young (•; 3-4 years of age) or older (O; 5-11 years of age) 
R falciparum malaria patients. 

25 Figure 5 

Levels of IgG with specificity for variant surface antigens in P. falciparum 3D7 before 
(panels A-B) and after (C-D) In vitro selection by iRBC reactivity with IgG in the plasma of 
semi-immune Ghanaian children. Open histograms show VSA-specific IgG reactivity in 
plasma from semi-immune Ghanaian children (A, C) or clinically immune Ghanaian adults 
30 (B, D). Shaded histograms show corresponding reactivity in plasma from donors without P. 
falciparum exposure (negative control). 

Figure 6 

An In vitro culture of ethidium bromide-labelled R falciparum 3D7-infected RBC following 
35 sequential exposure to the SMI plasma pool, biotinylated secondary anti-human IgG, and 
streptavidin-coated DynaBeads (m5-I, m3-2). Uninfected RBC (white left-arrows), and 
iRBC coated (white down-arrows) or not coated (white up-arrows) with DynaBeads, as well 
as free DynaBeads (black arrows) can be seen in the micrograph, 

40 Figure 7 

Plasma levels of IgG with specificity for variant surface antigens expressed by 3D7, 3D7smi, 
and five P. falciparum field isolates from Sudan. Plasma was obtained from Sudanese (A) 
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and Tanzanian (B) individuals, and levels are given on a semi-quantitative scale 
determined by recognition of intact iRBC by IgG In plasma pools from non-exposed donors 
(negative control pool) and highly exposed donors (positive control pool), respectively. IgG 
reactivity in plasma from the five parasite donors to the autologous R falciparum isolates 
5 are indicated by heavy lines. 

Rgure 8 

Adhesion of 3D7- and 3D7s„i-infected RBC to wild-type (CHO-0) and CD36-transfected 
(CHO-CD36) Chinese hamster ovary (CHO) ceils (m4-i). 

10 

Figure 9 

A) Schematic presentation of all 3D7 yar gene sequence analyses (m2-I, m2-2, m2-3). 
Gene names, chromosomal location, transcriptional direction and domain structure are 
shown along with the duster to which each gene was assigned by the sequences analyses, 

15 Sequences that could not be assigned to any cluster were named X. Three major var gene 
groups (group A-C), two intermediate groups group B/A and group B/C and two unique 
genes representing varl and var2 var gene families were defined (framed). B) Sequence 
analyses of var genes from other P. fafciparum strains than 3D7, Protein accession 
numbers, originating strain, domain structure and the closest related 3D7 var 5' sequence 

20 are shown along with sequence group allocations as defined in 3D7. *) The genes were 
assigned to group A, as their DBLla sequences clustered together with other group A 
sequences in analysis of DBLa sequences. **) Pseudogene, belongs to the varl family ^) 
Upstream sequences with atyplcally low similarity to upsB or upsC sequences. 

25 Figure 10 

Schematic representation of head-to-head genomic organisation of nf and upsA flanl<ed 
v^ar genes. Nine genes are flanked by a nf gene, which has its Initiation codon 
approximately 3 or 4 kb upstream from the var initiation codon, and one var gene by 
another var gene at -2 kb. Punctured lines represent upsA, dotted lines upsA-r/f and full 
30 line upsBsh. The diamond marks the putative termination site of upsA characterised by a 
stretch of TA repeats. Sizes of genes are not in scale- 
Figure 11 

Distance tree of 3D7 var gene 500 bp 3^ region generated using the p-dlstance/NJ method 
35 (m2-3;. The four dense clusters A through D were supported by both bootstrapping and 
maximum parsimony (MP) tree (not shown). The relationship of the remaining sequences 
could not be verified by the MP tree making method. Numbers at the nodes represent 
bootstrap proportions (BP) on 1000 replicates. The scale bar represents the proportion of 
different nucleotide compared. PlasmoDB accession numbers are shown. Genes with 
40 assigned cluster are collected in Figure 9. 
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Figure 12 

Distance tree of DBLaCIDRl domarns of 3D7 PfEMPl and pseudogene PFE1640w generated 
using the p-distance/NJ method {m2-3). The clusters A through E were supported by both 
bootstrapping and maximum parsimony tree (not shown). Numbers at the nodes represent 
5 bootstrap proportions (BP) on 1000 replicates. The scate bar represents the proportion of 
different amino adds compared. PlasmoDB accession numbers are shown. 

Figure 13 

fo\6 changes in var gene transcription by late-stage P. falciparum 3D7 before and after 
antibody-selection of parasites for selection of VSAsw-type antigens using DynaBeads 

10 coated with plasma IgG from three different pools of plasma from semi-immune African 
children (SMI, SM2, Sl^3) (mJ-2). Transcription levels were measured using real-time PCR 
and primers specific for 59 v^ar genes and one pseudogene (SatantI et aK, 2003) (m5-2). 
Panel A shows mean fold changes (± SD; 5 experiments) in var gene transcription related 
to selection using the SMI pool. Panel B shows fold change values in one experiment using 

15 the SM2 pool, while Panel C summarizes one experiment using the SM3 pool, Blacl< bars 
indicate the 15 most highly transcribed v^ar genes in selected and/or unselected 3D7. A 
3-fold change in var gene transcription (dashed lines) was arbitrarily defined as the cut-off 
for biologically significant changes in i^argene transcription. Primers see Table 1. Grouping 
of var genes is as described in (Lavstsen et a!., 2003). 

20 

Figure 14 

Changes in var gene transcription in synchronised ring-stage (30 hr) Plasmodium 
falciparum 3D7 in vitro selected with a pool of plasma antibodies from children from semi- 
immune African children (SMI, SM2) (se legend to Fig 13). Panel A shows mean fold 
25 changes (± SD; 5 experiments) In var gene transcription related to selection using the 
SMI pool. Panel B shows fold change values in one experiment using the SM2 pool. A fold 
change of 2 (dotted lines) was defined as the cut-off for biological interesting changes in 
var gene transcription. 

30 Figure 15 

Relative levels of v^ar transcripts in 10 pg of total RNA obtained from 3D7 (lane 1) and 
antibody-selected 3D7 (lane 2) ring stage iRBC. Northern blots (m7-i) probed with DIG 
labelled RNA probes targeting (Panel A) PFD1235w/MAL7PlA (SEQ ID NO.: 1). Transcript 
size is 12.5kb. {Panel B) Ethidium bromide-stained gel shown to allow comparison of total 
35 RNA amounts. 



Figure 16 

Expression of PfEMPl in 3D7 and antibody-selected 3D7trophozoite/schizont stage iRBC. 
40 Western blots {mlO-5, mlO-6) of SDS-extracts obtained before (Lane A) and following 
(Lane B) antibody selection. Samples were run in 5% SDS-PAGE gels, blotted and probed 
with antibodies to a relatively conserved sequence of the intracellular acidic segment 
ATS/Exon2 (lanes A-B) and a specific antibody raised against DBL5-6 of VAR4 (SEQ ID 
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NO.: 2)(Lanes C-D). The antibody-selected parasite expressed the product of 
PFD1235W/MAL7P1.1 (***)(SEQ ID NO.: 2) and a PfEMPl with molecular weight (MW) 
corresponding to the predicted MW of MAL6P1.316 {**). Both selected and unselected 
parasites expressed a PfEMPl of a MW corresponding to the predicted MW of PF08_0107 
5 (*), SDS-extracts of RBC infected by NF54 obtained from a Dutch volunteer (Hermsen et 
al., 2001). Extracts were obtained on day 8, 9, and 10 (Lanes E-G) and the blot probed 
with the DBL5-5 antibody used in Lanes C-D. 

Figure 17 

10 Surface expression of antigens in R falciparum 3D7 before (Al-5) and after (Bl-5) 
antibody selection {m3-2, mlO'6). Murine plasma antibodies against recombinant 
DBL5-6of VAR4 (SEQ ID NO.: 2) reacted with the surface of most antibody -selected 3D7 
IRBC in FACS (Bl; green line) and fluorescence microscopy (B2; green dots). The 
antibodies only reacted with a minority of unselected 3D7 (Al, A2), and with none of the 

15 trypsin-treated iRBC from antibody-selected 3D7 (Bl; purple ilne). Shaded histograms 

show flow cytometry reactivity In pre-vacclnation mouse plasma. VSA-specifIc IgG (A3, B3) 
reactivity in plasma from a semi-immune African child (red line) and a clinically Immune 
African adult (blue Mne) confirming the VSAum phenotype of unselected 3D7 and the VSAs„ 
phenotype of antibody-selected 3D7. Localization of the VAR4 (SEQ ID NO.: 2) using 

20 confocal microscopy and murine plasma anti-DBL5-5 antibodies {A4, A5, B4, B5). 

Ethldlum-bromide staining of DNA in the nuclei is red/orange and staining of VAR4 (SEQ ID 
NO.: 2) using FITC-labeled antibodies is green. Pre-vaccination mouse plasma did not stain 
iRBC (data not shown). 

25 Figure 18 

Plasma antibody levels to recombinant DBL5-6 (A), CIDRl-a (B), DBL3-p (C), and NTS (D) 
domains of the VAR4 (SEQ ID NO.: 2) in Tanzanian children and adults, and in Danish 
donors without P. falciparum exposure (DK) {ml2-l). For competition ELISA experiments 

30 {mll-Z), plates were coated with recombinant CIDRl-a domains of the proteins encoded 
by PFD1235W/MAL7P1.1 (SEQ ID NO.: 1 and SEQ ID NO.: 2) (E) and PFE1644w/v3rl (F), 
and ELISA reactivity measured in three different plasma samples, which had been pre- 
{ncubated with Increasing concentrations of homologous or heterologous fusion protein as 
indicated in the top part of each panel. Non-depieted (ND) plasma was included for 

35 comparison. 

Figure 19 

Similarity of the PFD1235W/MAL7P1.1 encoded protein (SEQ ID NO.: 2) homologues in 
genotypically distinct parasite isolates. Alignment of DBLl-ct/CIDRl-a from two peripheral 
40 blood parasites from children (BM021 and BM48) and the protein product of 

PFD1235W/MAL7P1.1 (SEQ ID NO.: 2) from 3D7 covering amino acids 842-1602 in 3D7 
SEQ ID NO. 2. Identical residues appear on black background, conserved amino acid 
changes on grey, and radical changes on white background. 
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Figure 20 

Fold changes In var gene transcription in ring-stage (3D hr) (Panel A) and by late-stage 
(Panel B) P. falciparum 3D7 before and after in vitro selection on human endothelial cells 
5 for selection of parasites expressing VSAs^-type antigens {m6-l, m6-2). Transcription 
levels were measured using real-time PCR and primers specific for 59 var genes and one 
pseudogene (Salanti et aL, 2003). The two panels shows fold change values in one 
experiment. Black bars indicate the 15 most highly transcribed var genes in selected 
and/or unselected 3D7. A 3-fold change in var gene transcription (dashed lines) was 
10 arbitrarily defined as the cut-off for biologically significant changes in var gene 

transcription. Primers see Table 1. Grouping of var genes is as described in (Lavstsen et 
ah, 2003). 

Examples 
15 Example 1: 

Erythrocytes infected by P. falciparum parasites causing severe maiaria (SM) are stronger 
and more commoniy recognised by IgG in plasma of malaria-exposed individuals tlian 
erythrocytes infected by other P. falciparum parasites 

20 P. falciparum-\nfected red blood cells (IRBC) adhere to endothelial host receptors through 
parasite-encoded, clonally variant surface antigens (VSA). The VSA-mediated iRBC 
adhesion and the acquired VSA-specific antibody response is linked to disease severity. 
Parasites isolated from young children with severe malaria (SM) express a limited and 
conserved set of VSA (VSAsm) that are both stronger and more commonly recognised by 

25 IgG in the plasma of malaria-exposed individuals than VSA (VSAum) expressed by parasites 
causing uncomplicated malaria (UM) in older semi-immune children. It is therefore likely 
that the SM-specific protective immunity acquired in young children in areas of intense 
parasite transmission is based on antibodies to VSAsm that inhibits the adhesion to 
endothelial cells of parasites expressing VSAsm on the surface of infected erythrocytes. 

30 

Materials and methods 

ml'l. Isolation of IRBC from maiaria patients: Circulating human erythrocytes infected 
with 

35 Plasmodium falciparum (iRBC) were collected in vacutainers containing either 

heparin or citrate-phosphate-dextrose (CPD) as anticoagulant. Plasma and white 
blood cells were removed upon centrifugation at SOOxg, and the erythrocyte pellet 
resuspended In an equal volume of freezing solution (28%(v/v) glycerol in 
4.2%(w/v) sorbitol and 0.9%(w/v) NaCI in HI2O) and snap-frozen in liquid Nitrogen. 

40 In vitro culture of P. falciparum parasites: Cryopreserved iRBC were restored by 

thawing at SV'C followed by washing in 3.5% NaCI^ and washing twice m 
RPMI 1640 culture medium ( h ttp : //www . I ifetech .com ^ . Parasites were maintained 
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In a 5% suspension culture of uninfected human O"*^ erythrocytes in RPMI 1640, 
supplemented with Albumax, hypoxanthin, glutamine, gentamycin (all 
http : //www ■ I if e tech . com ) . and non-immune human serum. Culture medium was 
changed and Giemsa-stained smears were prepared for microscopy on a dally 
5 basis. 

Purification of IRBC from cultures: IRBC with haemozoin-containing trophozoites and 
schizonts were purified from in vitro cultures (ml-l) by magnet-activated cell 
sorting (MACS; http://www.miltenyiblotec.com ). exploiting the magnetic properties 
of haemozoin. In short, iRBC were passed through a size-C MACS column mounted 

10 with a 0.9 mm x 40 mm needle. The column was washed with phosphate buffered 

saline (PBS) supplemented with 2% foetal calf serum (FCS; PBS-S) until no 
erythrocytes couid be seen in the eluate. The column was removed from the 
magnet, and the trophozoite- and schizont-containing iRBC retained in the column 
were then released by further washing. A purity of trophozoite-/schizont-infected 

15 iRBC >90% was usually reached by this procedure . 

Detection of human VSA-specific IgG: Purified IRBC (mJ-2) were labelled with 1 ^1 
ethidium bromide (EB; httD://www.siQma-aldrich.com > solution (0,1 mg/ml) per 
10^ erythrocytes to allow discrimination between nucleic acid-containing IRBC and 
uninfected erythrocytes devoid of DNA/RNA. For each sample, 2 x 10^ erythrocytes 

20 in 100 ^1 PB5-S (ml-3) were used, EB-labelled iRBC were mixed with 1-5 i.il of 

human plasma or antibody preparation, followed by goat anti-human IgG 
( http://www.dako.com^ . diluted 1:200 and by fluorescein isothiocyanate (FITC)- 
conjugated rabbit anti-goat Ig ( http : // www .dako. com ) . diluted 1:25. The 
antibodies were diluted in PBS-S, and 100 j-il of the dilution was added per sample. 

25 At each step, samples were incubated for 30 min at 5°C. The samples were washed 

twice in 3 ml PBS-S between each incubation step and once after the last. Samples 
were kept overnight at 5®C before analysis on a Coulter EPICS XL-MCL flow 
cytometer f htto : //beckman. com ). 
ml-4. For quantification of FITC fluorescence, the mean fluorescence intensity (MFI) of the 

30 ethidium bromide positive red blood cells was calculated using WinList software 

f http://www>vsh.com ^. Plasma from Danish donors never exposed to falciparum 
malaria did not label uninfected erythrocytes or iRBC above the level of the 
secondary and tertiary antibodies alone. In contrast, the plasma pool prepared 
from hyper-immune Ghanaians selectively labelled iRBC but not uninfected 

35 erythrocytes. 

mi -5. Human plasma samples tested: The individual human plasma samples were 
obtained from the following groups of individuals: 

Plasma from Danish adults without exposure to P. falciparum parasites were 
obtained at the Copenhagen University Hospital (Rigshospitalet) from laboratory 
40 staff and blood donors being screened for the presence of anti-RhD antibodies. 

Plasma from 96 Ghanaian children living In Dodowa Town, 50 km northeast of 
Accra. The area is characterized by hyperendemfc, seasonal transmission of P. 
falciparum parasites. All children were healthy at the time of blood sampling. 
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A pool of plasma from healthy, parasite-exposed adults from the village of Gomoa 
Onyadze, 80 km west of Accra (Nielsen et al. 2002). 

To corroborate the above hypothesis levels of antibodies in plasma from 96 healthy 
5 children, aged 3-8 years, with specificity for each of the 68 parasite isolates (Fig. 1) were 
measured {ml-4). Overall, antibody (Ab) recognition of the parasite VSA differed widely 
among plasma donors. While plasma samples from some children contained barely 
detectable levels of Abs specific for VSA expressed by any of the isolates (e.g., plasma 
donors 15, 34, and 59; see Fig. 1), others had high levels and a broad range of VSA- 

10 specific Ab (e.g., plasma donors 14, 41, and 84; Ftg. 1). By analyzing the parasite-specific 
sum of scores from the 96 x 68 recognition matrix, we found that recognrtion of VSA was 
independently associated with both the age of the malaria patient (3-^4,5-30, and 7-11 
years; p = 0.005) and the clinical picture (severe or nonsevere; p = 0.006, by two-factor 
ANOVA). There was no significant interaction between these two sources of variation (p = 

15 0.26). Pair wise multiple comparison procedures {Tukey's post-hoc test) showed that white 
the VSA sum of scores of parasites from the youngest patient group was significantly 
different from that from either of the two other age groups (p < 0.05), the latter two were 
not significantly different from each other (p >0.05). The relationship between severity and 
age of the parasite donor Is illustrated in Fig. 2, A and 6. These results show that Ab 

20 recognition of parasite VSA was independently affected by both the age and the clinical 
severity of the malaria patient from whom the parasite was obtained. This Is important, 
because disease severity Is inversely correlated with age in areas of endemic parasite 
transmission, which in all likelihood reflects age-dependent acquisition of protective 
immunity (reviewed in Riley et al. 1994). Protective immunity appears to involve 

25 acquisition of Ab responses to a broad range of VSA (Bull et al. 1998), and consistent with 
this observation we found that VSA Ab levels correlated with the age of the healthy plasma 
donors (Fig.2, C and D). The authenticity of our finding of independent effects of patient 
age and disease seventy is supported by the fact that the age distributions of children with 
severe and nonsevere disease were similar (by t test, p = 0.5) in the present study (5.30 ± 

30 0.4 and 30.0 ± 0.5 years, respectively; mean ± SD), due to the exclusion of children <3 
years of age. 

To further substantiate our findings and to investigate whether parasites from patients with 
severe P. falciparum malaria expressed particular VSA, we used the 96 x 68 recognition 

35 matrix (Fig. 1) to search for patterns of similarity in the VSA Ab recognition of the parasite 
isolates. Hierarchical duster analysis Identified three main clusters (data not shown). In 
one of these (cluster I), all but one (90%) of the isolates were from severe cases, whereas 
this was the case for only 5 of 20 (25%) in cluster II. The third and largest cluster (III) 
showed an intermediate pattern, with 21 of 38 (55%) isolates from patients with severe 

40 malaria (Fig. 3). The proportion of isolates from severe patients in the three clusters was 
thus quite different (by test, p ^ 0.009). When we analyzed the age distribution of the 
patients donating the parasites within the clusters, cluster I was composed of parasites 
from young patients, whereas the other two clusters contained parasites from older 
patients (Fig. 3A). The duster-specific differences in patient age composition did not quite 
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reach conventional statistical significance (by Kruskal-Walils test, p = 0.06). The 
distribution of the parasite-specific sum of scores among the clusters showed that duster 1 
was composed entirely of parasites expressing very welhrecognized VSA, whereas the 
opposite was true for duster II. Again, duster III formed an Intermediate group (Fig. 3B). 
5 The distribution of sum of scores was significantly different among the three clusters {by 
Kruskal-Wallis test,p < 0.001), with all pairwise differences being significant (by Dunn's 
post-hoc test, p < 0.01 in all cases). 

We have used flow cytometry to measure VSA-specificIgG to provide evidence of 
10 modulation of VSA expression by acquired immunity. Our method is particularly suited to 
this type of analysis, as It allows unbiased and quantitative analysis of large matrixes of 
VSA and corresponding Abs of spedfied Isotype. We found that the level of plasma IgG 
recognition of VSA expressed by P. falciparum isolates obtained from patients with severe 
malaria was approximately twice that of VSA from nonsevere isolates (Fig. ArA). In a similar 
15 way we found that VSA Ab recognition of isolates from young patients (3-4 years of age) 
was -2-fold that of isolates from older patients (5-11 years of age; Fig. 45). 

Thus, the VSA antibody recognition of parasites from severe patients is broader and more 
intense than recognition of VSA expressed by parasites from other malaria patients (Figs, 1 
20 and 2), independently of the age of the patient. Thus, our data suggest that acquisition of 
VSA-specificAb responses gradually restricts the repertoire of VSA that are compatible with 
parasite survival in the semi-immune host. Furthermore, it appears to limit the risk of 
severe disease by preventing the expression of VSA likely to cause life-threatening 
complications, such as cerebral malaria and severe anaemia. 

25 

Example 2: 

Sub-grouping of Plasmodium faldparum 3D7 var genes based on sequence analysis of 
coding and non -coding regions 

30 PfEMPl is a polymorphic family of high molecular weight adhesion antigens expressed on 
the surface of infected erythrocytes. PfEMPl is an Important target for protective immunity 
and is implicated in the pathology of malaria through its ability to adhere to host 
endothelial receptors. The accumulation of antibodies against a broad repertoire of 
PfEMPls is probably the functional basis for the natural acquisition of immunity to malaria 

35 (Bull et al, 1998) 

All var genes are characterised by a two-exon structure. Exon 1 encodes a large extra- 
er/throcytic and highly variable region containing two to seven Duffy-binding like (DBL) 
domains and mostly one or two cysteine-rich inter-domain region (CIDR) domains. Based 
on sequence homologies, the DBL domains can be sub-divided into a, p,y,6, and e types 
40 and the CIDR domains into CIDRa other (CIDR-O) types. A subset of var genes 

furthermore contains a second cysteine-rich domain called C2. Exon 2 encodes the intra- 
erythrocytic (cytoplasmic) and conserved part of the protein. 
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The entire genome of the P. fBicfparum clone 3D7 genome is now known, including its 
complete var gene repertoire. Figure 9 shows the domain structure of each of the 59 var 
genes as well as the truncated pseudo-gene PFB1640yst, 

S Methods 

m2'l. Nucleotide and deduced amino acid sequences as well as location and 

transcriptional directions of 3D7 var and r/f genes were obtained from the 
Plasmodium Genome Resource - http://www.Diasmodb.orQ . 

10 m2-2. Alignments were performed using the QustalW multiple alignment method, 

European Molecular Biology Laboratory, Heidelberg, Germany at default parameters 
{Gap Open: 10.00; Gap Extention: 0,20, Gap Separation Distance: 4, Protein 
weight matrix: Gonnet, DNA weight matrix: identity (lUB)). Alignments were 
corrected by hand using Bioedit (Hall, 1999) to assure homologous sequences for 

15 sequence analysis and tree-building. For distance tree-building, the var gene 5' 

flanking regions were defined as the 600 bp, 1.4 kb or 2.0 kb upstream of the 
translation initiation codon and the 3' flanking region as the 500 bp downstream of 
the translation stop codon. The available var flanking sequences from other strains 
than 3D7 varied between 250 and 2100 bp in length. For the most part 3' 

20 sequences were those retrieved by Mercereau-Puijalon et al (Mercereau-Puijalon et 

al. 2002} , l/ar gene domain structures were defined using definitions described in 
Smith et aL (Smith et al. 2000) . DBLoCIDRl domains were aligned from Pro-Cys 
(PC) of DBLo homology block A to the conserved Glu-Trp (EW) motif of CIDR M2 
area, resulting in sequence lengths of 550 to 650 aa. For analysis of DBL 

25 relationships sequences covering Pro-X-Arg-Arg (PXRR) of DBL homology block B to 

Glu-Trp (EW) of homology block H were aligned* 
m2-3. Distance trees were constructed the by p-distance/neighbour-joining (NJ) method 
as well as maximum parsimony (MP) using MEGA version 2,1 (Kumar et al. 2001). 
Trees were bootstrapped 1000 times and compared between NJ and MP tree- 

30 building methods to assure confidence in topology. Observed clusters from each 

tree were confirmed visually on alignments. 

The 1,5 kb 5* region of 3D7 var genes has previously been described to group into three 
major sequence groups, upsA, upsB and upsC (Gardner et al. 2002) . To further 

35 investigate sequence similarities in this region, we analysed the 2 kb upstream sequences 
of all 3D7 var genes and the pseudo var gene PFE1640w. In agreement with Gardner et aL 
(Gardner et al. 2002) the alignments revealed three major sequence groups with high 
similarity between sequences of each group. However, two sequences did not align well 
with any of the groups and witliin the groups, subgroups could be identified. Thus, each of 

40 the groups were analysed separately. 

Ten var genes had 5* regions belonging to the upsA group and all but one were positioned 
head-to-head with a nf gene, the exception PF08„0141 was head-to-head with another var 
gene (Fig. 10). Using a primer set targeting upsA 307 sequences around -900 bp from the 
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translation initiation codon, we could PGR annpiify products of the expected sizes in 3D7 
genomic DNA, as well as in five of five field-isolates tested (data not shown). These data 
suggests that upsA regions not are unique to 3D7. 

5 Alignments and tree-building (Fig. 11) of the 500 bp var 3' regions divided most sequences 
into four clusters (A-D), 13 sequences fell outside these dusters, and the relationship 
between these sequences could not be confirmed by bootstrapping or comparison the two 
tree-building methods used. 

10 Because most PfEMPl molecules contain a semi-conserved head structure comprising of 
DBLla and CIDRl, we restricted the analysis of coding sequences to these domains. In 
3D7 all but one var gene encode a DBLla as the first domain and in all but four genes DBLl 
is followed by a CIDRl. Since alignment and tree constructions of DBLl and CIDRl 
domains individually yielded almost identical clusters, we decided to analyse the head 

15 structure sequences from the N-terminal region of DBLl to a conserved motif in the C- 
terminal region of CIDRl (Fig. 12). Fifty-two sequences, including that of pseudogene 
PFEi640w, could be grouped into five clusters^ and four sequences could not be assigned 
any of these. When all CIDR sequences are aligned most CIDRls fall into separate clusters 
of CIDRa or CIDRal domains (Robinson et aL 2003), The exceptions are three sequences 

20 {PF08_0141, PFl 1^0008, PF13_0003), which fall into a CIDRy cluster. In Figure 12, the 
head structures of these genes fall into group A. Robinson et at. (Robinson et al. 2003) 
found that most CIDR domains bind CD36 but identified nine, which did not. These 
constitute cluster A. 

25 Figure 9 schematically sums up the findings of all the var gene sequence analyses. The 
combination of clusters and chromosomal organization of the var genes indicate that var 
genes can be grouped into three major subgroups, var group A, B and C and two 
intermediate groups group B/A and group B/C, which appear to represent transitions 
between these three groups. The two genes previously shown to belong to conserved var 

30 families, varX and var2, fell outside these groups. Group A var genes were most easily 
defined, whereas the borders of the proposed group B and C were less clear (Fig. 9A), The 
grouping was supported by analyses of both coding and non-coding sequences. However, 
the best predictors for the groups were the upstream region and chromosomal 
organization. Thus, genes placed near the telomere and with a transcriptional direction 

35 towards the telomere all had upsA sequences and formed group A. Group B were 

dominated by telomeric located but centromeric transcribed genes flanked by upsB and 
finally group C harboured all centromeric located genes with a upsC SVegion. 

Group A comprise most large PfEMPls with a domain structure different from the most 
40 common 4-domain type, which is the predominant domain structure of Group B and C. 

Two genes PF08_0140 and MAL6P1.316 were classified as B/A because they had upsBsh 5' 
regions and chromosomal characteristics in common with group B genes, but had DBLo- 
CIDRl sequences and domain structure characteristic for group A genes. Interestingly, 
these two genes are adjacent to a group A var or a pseudo var gene with an upsA region 
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both transcribed in the opposite direction, thereby merging their 5'regtons, Adding the 
DBLla-CIDRl of the flanking pseudogene MAL6P1.317 to the alignments placed this 
pseudogene within DBLo-CIDRl cluster A. 

5 The fact that 5* regions predict var gene chromosomal organisation and domain structure, 
and sequence similarities in coding and non-coding regions several thousand bases 
downstream from the translation initiation site implies that recombination, or other 
mechanisms of homogenizing exchange is much more likely to occur between var genes 
within a group than between var genes of different groupings. It can be proposed that an 

10 original ancestral var gene has been duplicated and diverged in the three main types, and 
each of these have then diverged into the genes of each group. In this process information 
may also have been exchanged between genes of different groupings. The data suggests 
that some exchange have talcen place between groups B and C and some characteristics of 
group A have lealced into these groups, but that characteristics from groups B and C have 

15 not gained access to group A. 

In conclusion, var genes can be sub-grouped into three major groups (group A, 3 and C) 
and two intermediate groups B/A and B/C representing transitions between the three 
major groups. The best defined var group, group A, comprises teiomeric genes transcribed 
20 towards the telomere encoding PfEi^Pls with complex domain structures different from the 
4'domain type dominant of groups B and C. A rif subgroup transcribed towards the 
centromere was found neighbouring var genes of group A such that the rif and var 5' 
regions merged. This organization appeared to be unique for the group A var genes, 

25 Example 3; 

Selection of P. falciparum Isolate 3D7 for expression of well recognised VSA In vitro 

Establishment of the genetic control of changes in VSA expression in response to In vitro 
selection is now possible because of the availability of the entire genomic sequence of the 
30 P. falciparum clone 3D7, which is a long-term clone derived from P. falciparum NF54 
isolated from a Dutch malaria patient (Delemarre and Van der Kaay, 1979). 

As a first step towards direct molecular identification of VSAsM-encoding genes in 3D7, we 
established a method to enforce expression of VSAsM-l'ke antigens in this parasite clone by 
35 a novel selection method using plasma from semi-immune children with low levels of 
VSAuM-IgG but high levels of VSAs„-IgG (Fig. 5). 

Materials and methods 

40 m3'l. To enrich 3D7 for iRBC expressing VSA well recognized by IgG in the SMI plasma 

pool, 1 X 10® RBC infected with late trophozoite-stage parasites purified by gelatine 
flotation) were mixed with 200 pL pooled plasma In 3 mL culture medium, and 
incubated the mixture at room temperature (20-25^c) for 15 min. Antibodies not 
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reactive with intact iRBC were removed by washing ^ 2 (800 g, 30 min). The iRBC 
were then resuspended in 3 mL culture medium and incubated as above with 
100 pL biotinylated secondary antibody recognizing human IgG (Dako, Glostrup, 
Denmark). The IRBC were washed twice and resuspended as above and 
5 subsequently mixed with 100 |jL streptavidin-coated DynaBeads (Dynal, Oslo, 

Norway). Finally, we isolated Dyna Beads-coated iRBC by placing the IRBC 
suspension in a magnet field until all DynaBeads had settled at the magnet. The 
culture medium containing RBC and iRBC not covered with DynaBeads was 
removed by decantation. This procedure was repeated once before the DynaBeads- 

10 covered iRBC were transferred to a new culture bottle with medium and uninfected 

RBC for continuation of In vitro culturing. 
m3'2. Human plasma samples used for selection: We used repeated rounds of panning 
on DynaBeads coated by IgG from two plasma pools (SMI, SM2) from semi- 
immune Ghanaian children and one plasma pool (SM3) from semi-immune 

15 Tanzanian children to select 3D7 parasites expressing VSA that were highly 

recognized by IgG in these plasma pools (Staalsoe et al., 2003). 

Flow cytometry analysis of unseiected 3D7 indicated that its VSA expression was 
heterogeneous but that the majority of the iRBC expressed VSA that were poorly 

20 recognised by IgG in the plasma of individuals living in areas of endemic P, falciparum 
transmission (Fig. 5). In particular, only few IRBC were specifically labelled by IgG In the 
plasma of semi-immune children with documented high levels of IgG with specificity for 
VSA expressed by P. falciparum isolated from patients with severe malaria (VSAsm) and low 
levels of VSA expressed by patients with uncomplicated malaria (VSAum) (Fig. 5A). 

25 Although IgG levels in the plasma of sympatric adults were higher than in children, they 
were still moderate (Fig. 5B). Taken together, these data indicate that the VSA expressed 
by unseiected 3D7 were of the VSAum type. 

Only a minority of RBC infected by unseiected 3D7 expressed VSAs„-type antigens. To 
30 increase the proportion of iRBC expressing VSAsM'type molecules in the culture, we first 
incubated 3D7 with the SMI pool of plasma from semi-immune children, then with 
biotinylated anti-human IgG and finally with streptavidln-coated DynaBeads. As expected 
from the heterogeneous IgG recognition of VSA expressed by unseiected 3D7, this resulted 
in some IRBC being covered by multiple DynaBeads while most were devoid of any beads 
35 {Fig. 6). We subsequently isolated the Dyna Beads-covered IRBC by exposing the culture to 
a strong magnetic field, and used only these to propagate the culture further. Flow 
cytometry analysis showed that the IgG recognition of the VSA expressed by the 3D7smi 
sub-clone obtained by three rounds of selection in this manner was much stronger than 
that of VSA expressed by the unseiected 3D7 parental clone (Fig. 5C-D). The biggest 
40 increase in IgG recognition of VSA following selection was seen in the plasma from children 
with documented high levels of VSAsM-type IgG and low IgG levels with specificity for 
VSAuM-type antigens (Fig. 5A and 5C). This, and the fact that the level of IgG recognition 
of 3D7sMi IRBC was both high and similar In children (Fig, 5C) and adults (Fig. 5D) show 
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that the applied selection method had caused a change from predominantly VSAuM-type 
antigens being expressed in unselected 3D7 to VSAsM-type expression in 3D7s„,. 

The 3D7sMi VSAgM-expressing sub-clone was selected by its reactivity to IgG in a plasma 
5 pool from West African children living in an area of hyperendemic, seasonal P. falciparum 
transmission. To investigate whether the high VSA-speciflc IgG reactivity against 3D7smi 
extended beyond this region and in a variety of epidemiological settings, we measured 
levels of IgG with specificity for the VSA expressed by 3D7smi in series of plasma samples 
obtained from Individuals (8 to 42 years of age) living in an area of hypoendemic, highly 

10 seasonal transmission in Sudan {Fig. 7A) and from children aged 3-11 years living In area 
of very Intense transmission in Tanzania (Fig. 7B). Ail 20 Tanzanian children had medium- 
to-high levels of IgG unselected 3D7 iRBC and to all of five local parasite isolates, 
reflecting the high endemicity In their area of residence (Fig. 7B). Nevertheless, levels of 
IgG with specificity for the 3D7 smi VSA were higher than those with specificity for 

15 unselected 3D7 (Fig. 7B). Importantly, levels of 3D7 smi VSA-specific IgG were higher than 
IgG with specificity for VSA expressed by any of the local isolates, although 3D7smi was 
selected on IgG from West Africa. Taken together, these data suggest the existence of 
substantial conservation in time and space of the commonly and highly recognised VSAsm- 
type IRBC surface antigens. 

20 

Jn conclusion, these findings open the possibility of Identifying VSA of Importance In the 
pathogenesis of severe disease (VSAsm) by comparing VSA gene expression In Isogenic 
parasites expressing VSAum' and VSAsM-type IRBC surface molecules that are characterised 
by their differential IgG recognition pattern rather than by their adhesion specificity. 

25 

Example 4: 

3D7 parasite expressing VSAsm does not adhere to CD36 

Most members of the PfEMPl family of 3D7 posses the capacity to bind CD36 and the 
30 binding motives have been mapped to the N-termlnal part of the molecules. PfEMPl that 
bind CD36 are structurally and functionally related (Robinson et al 2003) and belong to 
Groups B, C, or B/C (Lavstsen et al. 2003) and Example 2- In this example we show that 
3D7 expressing a VSAum phenotype bind CD36, whereas 3D7 expressing the VSAsm 
phenotype does not. 

35 

Materials and methods: 

m4'l. We used Chinese hamster ovary (CHO) cells transfected to express human CD36 
and cultured by standard methodology to measure IRBC adhesion to these 
40 receptors (Staalsoe et al. 1999; Hasler et al. 1993; Rogerson et al. 1995). In brief, 

parasites were radiolabeiled by incubating the cultures overnight in the presence of 
^H-phenylalanin (1 MBq for a standard culture containing 200 pi packed RBC)- 
Wlldtype and CD36 transfected CHO cells were grown to a monolayer in 96-wel» 
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microtitre plates (Nunc, Roskilde, Denmark). Late-stage-enriched »RBC (100 \}\, 
1x10^ RBC/ml) were added to the CHO cell monolayer and incubated for one hour 
at 37^C before unbound iRBC were washed away from the CHO cell monolayer. 
Finally, the number of CHO-adhering iRBC was determined by liquid scintillation 
5 spectrometry. 

To Investigate the relationship between VSAsm expression and IRBC adhesion to CD36, we 
compared the ability of RBC infected by 3D7 and 3D7smi to adhere to CD36-transfected 
{CD36-CHO) CHO cells (Fig. 8). While 3D7 showed four- to 10-fold higher adhesion to 
10 CHO-CD36 cells than to wild-type CHO-0 cells, adhesion of 3D7smi to the transfected cells 
was never more than twice that to wild-type cells (Fig. 8). The average adhesion ratio was 
30-3 for 3D7 and 1-4 for 3D7smi/ corresponding to a difference of 4-9 [95% confidence 
interval: (1-9-7-9)]. These results indicate that there Is a relationship between VSAsm 
expression and the loss of ability of iRBC to adhere to CD36. 

15 

In conclusion, these results indicate that parasites expressing the VSAsm phenotype do not 
adhere to CD36, 



20 Example 5: 

Selection of P. falciparum isolate 3D7 for expression of well recognised VSA in vitro results 
in selective up-regulation of Group A var genes 

Parasite-encoded PfEMPl proteins expressed on the surface membrane of iRBC mediate 
25 the adhesion of such erythrocytes to a range of host receptors. The PfEMPl proteins are 
encoded by the var gene family containing 50-60 members per haploid parasite genome. 
Different PfEMPl molecules have different receptor specificities, and clonal switching 
between expression of the various var gene products in a mutually exclusive 
manner allows the parasite to modify its adhesion properties. Gene expression and 
30 switching can be examined using gene-specific primers and real-time PCR. 

Using real-time quantitative PCR we compared var gene expression in the parasite line 
3D7 before (3D7um) and after (3D7sm) selection for antibody recognition (Example 3), 

mS-i. RNA was purified from 3D7um and 3D7sm and used for the synthesis of cDNA. Total 
35 RNA was prepared with Trizol LS ( http: //www . invitrocen .com ) as recommended by 

the manufacturer, and treated with DNAsel (http : //www > i nvitrogen . com ) , Absence 
of DNA in RNA samples was confirmed by stable base fluorescence after 40 cycles 
of real-time PCR with seryhtRNA synthetase primers (Salanti et al,, 2003)) (Table 
1). One HQ of DNA-free RNA was reversed transcribed using Superscript II and 
40 random hexamer primers ( ( http : // www Jn vltrooen ,com >l at 25**C for 10 min, 42**C 

for 50 min followed by 70'*C for 15 min in a total volume of 40 |il. 

Table 1 
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Primer sets used in real-time PCR assays to specifically amplify 59 var genes and one 
pseudogenes (underlined). Where several genes are listed next to a single primer set, 
primer targets in the listed genes were identical- 
Primer Forward primer Reverse primer Target gene(5) 

set 

1 TGCGCTGATAACTCACAACA AGGGGTTCATCGTCATCTTC PFAOOOSw 

4 GACGAGGAGTCGGAAAAGAC TGGACAGGCTTGTTTGAGAG PF10_0001 

5 GTGCACCAAAAGAAGCTCAA ACAAAACTCCTCTGCCCATT PF10_0406 

6 GAGGCTTATGGGAAACCAGA AGGCAGTCTTTGGCATCTTT PFl 1^0007 



7 GACGGCTACCACAGAGACAA CGTCATCATCGTCTTCGTTT PFl 1^0008 (SEQ ID NO.: 

3) 



8 


TGCTGAAGACCAAATT6AGC 


TrGTTGTGGTGGTTGTTGTG 


PF11_0521 


9 


TCGATTATGTGCCGCAGTAT 


TTCCCGTACAATCGTATCCA 


PFL0Q20W 


10 


TGGTGATGGTACTGCTGGAT 


TTTAI J 1 ICGGCAGCATTTG 


PFL0030C 


11 


GACGCCTGCACrCTCAAATA 


TTGGAGAGCACCACCATTrA 


PFL0935C 


12 


AGCAAAATCCGAAGCAGAAT 


CCCACAGATCi 1 \ ICCTCGT 


PFL1950W 


15 


CATCCATTACGCAGGATACG 


AAATAGGGTGGGCGTAACAC 


PFL1960W 


17 


GGCACGAAGTTTTGCAGATA 


TTTGTGCGTC rn C ri CGTC 


PFL2665C 


18 


CGGAGGAGGAAAAACAAGAG 


TGCCGTATTTGAGACCACAT 


PFL0005W 


19 


CGGAATTAGTTGCCTTCACA 


CATTGGCCACCAAGTGTATC 


PF13_0364 


20 


CACAGGTATGGGAAGCAATG 


CCATACAGCCGTGACTGTTC 


PF13 J003 (SEQ ID NO 
5) 


21 


CAAI T n GGGTGTGGAATCA 


CACTGGCCACCAAGTGTATC 


PFB 1055c 


22 


ATGTGCGCTACAAGAAGCTG 


TrGATCTCCCCATTCAGTCA 


PFBOOlOw 


23 


CAATCTGCGGCAATAGAGAC 


CCACTGTTGAGGGG 1 1 nCI" 


PFC1120C/PFC0005W 


25 


ATATGGGAAGGGATGCTCTG 


TGAACCATCGAAGGAATTGA 


PFD0020C 


26 


ACCGCCCCATCTAGTGATAG 


CACI iGGIGATGTGGTGTCA 


PFD06i5c 


27 


TAAAAGACGCCAACAGATGC 


TCATCGTCTTCGTCTTCGTC 


PFD0625C 


28 


ACTTTCTGGTGGGGAATCAG 


TTCACCGCCACTTACTTCAG 


PFD0630c/PFD0635c 


30 


GACGACGATGAAGACGAAGA 


AGATCTCCGCATTTCCAATC 


PFDOOOSw 


32 


ACCTAATGGCGAAAACCAAG 


ACACTTGCCTTCATCCACTG 


PFDlOOOc 


34 


TGCAACGAAACATTAGCACA 


AGCAGGGGATGATGCTTTAC 


PFD1015C 


35 


AAACACGTTGAATGGCGATA 


GACGCCGAGGAGGTAAATAG 


PFDl235w/MAL7Pl,l 








(SEQ ID NO,: 1) 


36 


TGACGACrCCTCAGACGAAG 


CTCCACTGACGGATCTGTTG 


PFD1245C 


37 


AAGAAAGTGCCACAACATGC 


GTTCGTACGCCTGTCGTTTA 


P,fr£16,40w 


38 


GAAGCTGGTGGTACTGAC6A 


TATTTTCCCACCAGGAGGAG 


PFE0005W 


39 


ATTTGrCGCACATGAAGGAA 


AACTTCGTGCCAATGCTGTA 


MAL6P1.252 


41 


GGTGTCAAGGCAGCTAATGA 


TATGTCCTGCGCTAI IliGC 


PF08_0141 


43 


GTCGTGGAAAAACGAAAGGT 


TATCTATCCAGGGCCCAAAG 


PF08_0142 


44 


ATGTGTGCGAGAAGGTGAAG 


TGCCrrCTAGGTGGCATACA 


MAL6P1.4 


45 


CAAI \ n ICCGACGCTTGTA 


CACATATAGCGCCGTCCTTA 


PF07_0048 


46 


GCGACGCTCAAAAACATTTA 


TCATCCAACGCAATCI 1 IGI 


PF07_0050 


49 


GTTGAGTCTGCGGCAATAGA 


CTGGGGTTTGTrCAACACTG 


PF07_0049 


50 


CACACATGTCCACCACAAGA 


ACCCTTCTGTGGTGTCTTCC 


MAL7P1.56 


51 


ACGTGGTGGAGACGTAAACA 


CCTTTGTTGTTGCCACI \ VG 


MAL7P1.55 


52 


CGTGGTAGTGAAGCACCATC 


CCCACCTTCTTGTGGTTTCT 


PF07_0051 


53 


TGACGACGATAAATGGGAAA 


TTCTTTTGGAGCAGGGAGTT 


PF07_0139 


54 


ACCAAGTGGTGACAAAGCAG 


GGGTGGCACACAAACACTAC 


PFD1005C/PFD1015C 


55 


TTTGTCCGGAAGACGATACA 


ATCTGGGGCAGAATTACCAC 


PF08„0106 
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Pnmer Forward primer 
set 




Taraet aenefs) 




CACACGTGGACCTCAAGAAC 


AAAACCGATGCCAATACTCC 


PFI 1830c 


66 


CCTAAAAAGGACGCAGAAGG 


CCAGCAACACTACCACCAGT 


PF08_0107 


67 


AAGGGAAGATTGGTGGACAG 


AGGGGGATCAGTATCACGTC 


MAL6P1,314 


91 


ACAAAGGAACGTCCATCTCC 


GCCAATACTCCACATGATCG 


PF13_0001 


92 


TGCAAGGGTGCTAATGGTAA 


CCTGCAI rnGACATTCGTC 


PF08„0103 


93 


GACAAATACGGCGACTACGA 


TGTTTCACCCCATTCTTCAA 


MAL6P1.1 


94 


TGGAAAGAACATGGACCTGA 


TTCCTCGAGGGAAGAATCAC 


MAL6P1,316 


95 


TCACAACCTGACCCCCTACT 


TCTTCGTCGTTGTCATCCTC 


PFD0995C 


96 


TGACCAAGACGAAGTATGGAA TTGATCTCTGTTCGCTGTCC 


PFIi820w 


97 


TCATTATGGGAAGCACGATT 


TGATTTCTACCATCGCAAGG 


PFA0015C 


98 


ATGGTGGCAAACTTGTGAGA 


TCCAATTGGTCTCCrTGACA 


PF08_0140 


99 


AGGGAGCATCAGGTGGTAGT 


GCTGTGCATGCI 1 1 1 ICATT 


MAL7P1.50 


100 


GATCAAAAGAGGCGGAGAAG TTCCAATTGGG6AA 1 1 1 UC 


PFIOOOSw 


101 


CAAGAGACACAACCGGAAGA 


CACTTCCAATrGGGGAArnr 


PFA0765C 


60* 


AAGTAGCAGGTCATCGTGGTT TTCGGCACATTCTTCCATAA 


PF07„0073 



*Endogenous control gene: seryhtRNA synthetase {PF07_0073) 



To study gene expression of individual var genes a specific primer set for each of 59 var 
5 genes and one pseudogenes in the 3D7 genome was designed (Table 1), and real-time 
PCR was performed on cDNA from 3D7um and 3D7sm. 

m5-2. Real-time PCR was done using a Rotorgene thermal cycler system 

f htto://www.corbettresearch.com V Change in PFD1235w/MAL7Pl.l (SEQ ID NO-: 

10 1), PF11_0008 (SEQ ID NO.: 3) , and PF13_0003 (SEQ ID NO,: 5) gene 

transcription after selection for antibody recognition. Specific primers targeting SEQ 
ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 (Figs. 13 and 14} were used to 
measure transcription before (3D7u„) and after antibody selection (3D7sm). The fold 
change in transcription levels, normalised against seryl-tRNA synthetase, was 

15 calculated for each primer set by the AACt method (User Bulletin #2, Applied 

Blosystems, http://www,aDDliedblQsystems.com ). Reactions were performed in 
20 nl volumes using QuantiTect SYBR Green PCR master mix and 0.5 mM primers, 
according to manufacturer's instructions ( http://ww w.Qiaaen.coml. PCR cycling 
conditions optimised for P. falciparum cDNA were 95**C for 15 min followed by 40 

20 cycles of 94^C for 30 sec, 54*»C for 40 sec, and es^C for 50 sec with a finai 

extension at 68«*C for 10 min. Data acquisition was done at the end of elongation of 
each cycle. Specificity of amplification was ascertained by melting-curve analysis of 
each PCR product. Electrophoresis of PCR products and EB staining was performed 
and revealed no bands from no-template controls and single bands for all targets in 

25 cDNA PCR products. Quantification was done using the Rotorgene software version 

4,30 ( http://www,qiaaen.com h Transcription levels of the endogenous 
P. fatdparum genes actin, seryhtRNA synthetase and aldolase were analysed in 
order to determine the most accurate endogenous control. P, falciparum seryhtRNA 
synthetase disptayed the most uniform transcription profile in different parasite 



36017DK01 



85 

isolates and an unchanged pattern throughout the parasite life and was thus used 
for calculations of fold changes In var gene transcription by the ACT method 
(described in User Bulletin #2, Applied Biosystems, 
ht-tp://www.appliedh insYstems.com). 

5 

mS.3. Genes {PF11_0521, PFL0030C, PFL1950w, PFLOOSw, PF13_003 (SEQ ID NO.: 5), 
PFB1055C, PFD1235W/MAL7P1.1 (SEQ ID NO.: 1), PFE1640W, PFI1830, 
PF08_0107, MAL6P1.314, MAL6P1.316, PFD0995c, PFAOOlSc, seryl-tRNA 
synthetase) used for absolute quantification were PGR amplified, ligated into the 

10 PCR2.1 TOPO vector and transformed into Eschericla coll TOPOlO celts (TA cloning 

System, Invitrogen). Plasmids were purified using Qiagen Miniprep spin columns 
(Qiagen, Merck Eurolab, Albertslund, Denmark) and the identity of inserts verified 
by subsequent sequencing on an ABI Prism 310 (Perkln-Elmer) using the Big Dye 
terminator reaction mix, ABI Prism proofreading and translation software, and the 

15 P. falciparum 3D7 genome database (http://www.plasmodb.orq). 

Piasmid concentrations were determined by spectrophotometry and serial 10-fold 
Tris-EDTA buffer (pH=7.5) dilutions ranging from 5x10° to SxlO" template copies 
per real-time PGR reaction. Triplicate real-time measurements were made for each 
dilution and a best-fit standard curve was generated using the RotorGene 2000 

20 software. The standard curves were linear across a range of seven logs of DNA 

concentrations with correlation coefficients between 0.9779 and 0.9969. The 
detection limit of the system was 2 20 copies (data not shown). The coefficient of 
variance (CV) was calculated as 100*(standard deviation/mean). RT-PCR was 
performed as described using 1 pg total RNA in a total volume of 40 pi of which 0.5 

25 |Jl was subsequently used for real-time PGR. Absolute values were calculated from 

the standard curves. 



Real-time PGR followed by calculating fold change in 3D7um compared to 3D7sm 
demonstrated marked upregulation of three Group A var genes PFD1235/MAL7P1.1W (SEQ 

30 ID NO.: 1), PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5). The 

transcription of Group B, C or B/C var genes was either downregulated or not regulated in 
the 3D7s„ compared to 3D7um (Figs. 13 and 14, Table 2). The up-regulated var genes 
PFD1235W/MAL7P1.1 (SEQ ID NO.: 1), PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ 
ID NO.: 5) were also the most dominant var transcript when doing absolute quantification 

35 (Table 2) with var gene-specific primers. The upreguiated and dominant 

PFD1235W/MAL7P1.1 , PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) gene 
showed 3 to 10-fold higher level of expression following antibody selection both in ring- 
stage and trophozoite/schizont-stage 3D7 parasites (Table 2). 
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In conclusion, IgG selection resulted In marked upregulation ofSEQ ID NO.: h 3, and 5. 
Example 6: 

5 Selection of P. falciparum Isolate 3D7 for adhesion to human endothelial cells in vitro 
results In selective up-regulatlon of Group A var genes 

The particular virulence of R falciparum is due to the ability of infected erythrocytes to 
adhere to a variety of host receptors on the endothelial lining such as ICAM-1, VCAM, 
10 thrombospondin, ELAM-1, and CD36, and avoid splenic clearance. To show that selection 
of 3D7 on human endothelial cells leads to a serological phenotype and a transcription 
profile similar to 3D7smi we did several rounds of selection on bone marrow derived 
endothelial cells generating 3D7endo- 

15 m6-l. We used human endothelial cells to select 3D7 for increased VSAsn expression. 

Human endothelial cells were cultured by standard methodology to measure iRBC 
adhesion to receptors on the endothelial cells. In brief, parasites were radiolabelled 
by incubatijng the cultures overnight in the presence of H-phenylalanin (1 MBq for 
a standard culture containing 200 pi packed RBC). Late-stage-enriched iRBC (100 

20 pi, 1x10^ RBC/ml) were added to the endothelial cell monolayer and incubated for 

one hour at 37°C before unbound iRBC were washed away from the endothelial cell 
monolayer. Finally, the number of endothelial cell-adhering iRBC was determined 
by liquid scintillation spectrometry. 
m6-2, Real-time PCR was done using a Rotorgene thermal cyder system 

25 ( http : //www . corbettresearch . com ^ . Change in PFD1235W/MAL7P1.1 (SEQ ID NO.: 

1), PFll^OOOS (SEQ ID NO.: 3) , and PF13_0003 (SEQ ID NO.: 5) gene 
transcription after selection for antibody recognition. Specific primers targeting SEQ 
ID NO.: 1, 3, and 5 (Fig. 20) were used to measure transcription before (3D7um) 
and after antibody selection {3D7^t^). The fold change in transcription levels, 

30 normalised against seryl-tRNA synthetase, was calculated for each primer set by 

the AACt method (User Bulletin #2, Applied Biosystems, 

http : //www .apoiiedb tQsystems.com ^ > Reactions were performed in 20 \s\ volumes 
using QuantiTect SYBR Green PCR master mix and 0,5 mM primers, according to 
manufacturer's instructions (htt p : //www. qiagen xom ) , PCR cycling conditions 

35 optimised for P. falciparum cDNA were 95^C for 15 min followed by 40 cycles of 

94^C for 30 sec, 54°C for 40 sec, and 68°C for 50 sec with a final extension at 
68*C for 10 min. Data acquisition was done at the end of elongation of each cycle. 
Specificity of amplification was ascertained by melting-curve analysis of each PCR 
product. Electrophoresis of PCR products and EB staining was performed and 

40 revealed no bands from no-template controls and single bands for all targets in 

cDNA PCR products. Quantification was done using the Rotorgene software version 
4.30 ( httD://www.qtaQf!n.com l. R falciparum seryhtRNA synthetase was used for 
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calculations of fold changes in var gene transcription by tlie ACT method (described 
in User Bulletin #2, Applied Biosystems, ht:tp://www>appliedbiosvstems.com). 

The resulting cell line 3D7endo showed a FACS profile similar to that of 3D7smi (Example 3 
5 and Fig. 5) and a marked up-regulation of the PFD1235w/MAL7Pl.l (SEQ ID NO.: 1), 
PFll^OOOB (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) var genes as analysed by 
real-time quantitative PGR (Fig, 20). 

In conclusion, we find that selection of 3D7 for adhesion to human endothelial cells confer 
10 a serological phenotype, VSAsm similar to that of parasites causing severe malaria and to 
3D7 selected with chlldrens plasma. As in 3D7smi the acquisition of the VSAsm phenotype in 
3D7^„^o Is accompanied by marked upregulation of the PFD1235w/f^AL7Pl.l (SEQ ID NO.: 
1), PFl 1^0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) var genes. 

15 Example 7: 

Northern blots of antibody-selected, ring-stage 3D7 showed strong hybridization signals 
corresponding to full-length transcripts of PFD1235w/I^AL7Pl.l (SEQ id no.: i) 

Materials and methods: 

20 

m7-i. For Northern blotting, we used 10 iig of total RNA separated In a standard 

denaturing MOPS-formaldehyde agarose gel and transferred to positively charged 
nylon membranes overnight (Sambrook et aK, 1989). RNA was cross-linked to the 
membrane by baking for 30 min at 120oc. DIG-labeled RNA probes were generated 
25 using the DIG RNA labeling kit (Roche, Hvidovre, Denmark). Hybridization, 

washing, and detection were done according to the manufacturer's 
recommendations with a hybridization temperature of SS^C in DIG Easy Hyb buffer 
(Roche). 

30 Northern blots of antibody-selected, ring-stage 3D7 showed strong hybridization signals 
corresponding to full-length transcripts of the upregulated var gene PFD1235w/MAL7PlA 
(SEQ ID NO.: 1) (Fig.lSA). The corresponding signals were either absent or much weaker 
in Northern blots of unselected 3D7. Similarly, PF11_0008 (SEQ ID NO.: 3) and PF13_0003 
(SEQ ID NO.: 5) were found to be present as full-length transcripts in antibody-selected 

35 3D7, but not in 3D7 (data not shown). 

Taken together, these results shows that antibody-selected 3D7 expressing VSAsM-type 
antigens transcribe full-length PFD1235w/MAL7Pl.J (SEQ ID NO.: 1), PF11_.0008 (SEQ ID 
NO.: 3) and PF13_0003 (SEQ ID NO.: 5)- 

40 



36017DK01 



89 

Example 8: 

The PFD1235W/MAL7P1.1 (SEQ ID NO,: 1), PF11_0008 (SEQ ID NO.: 3), and PF13_0003 
(SEQ ID NO, : 5) genes that is selectively upregufated in P. falciparum isoiate 3D7 following 
selection for antibody recognition in vitro belong to the Group A var genes and share their 
5 characteristics 

From Example 3, 4, 5, and 6 it appears that tlie Group A PFD1235w/MAL7Pl.l (SEQ ID 
NO.: 1), PFll^OOOS (SEQ ID NO.: 3) and PF13^0003 (SEQ ID NO.: 5) genes encodes 
VSAsM-like proteins, and that this protein is involved in the acquisition of the VSAsm 

10 phenotype following antibody-selection of VSAuM-expressing 3D7 (3D7sm)* The fact that 
almost identical results were obtained using plasma pools from children Irving in West 
Africa {pools SMI and SM2) and East Africa (plasma pool SM3) indicate that a PfEMPl 
similar to that encoded by PFD1235w/MAL7Pl,l (SEQ ID NO.: 1), PF11_0008 (SEQ ID 
NO.: 3) and PF13_0003 (SEQ ID NO.: 5) are present in P. falciparum parasites transmitted 

15 on both sides of this vast continent. 

In conclusion, PFD1235w/MAL7PlA (SEQ ID NO.: 1), PF11__0008 (SEQ ID NO.: 3) and 
PF13_0003 (SEQ ID NO.: 5) seems to belong to a sub-group of var genes, with a 
constrained ability to recombine, that are functionally conserved and commonly recognised 
20 by VSAsM-antibodies. 

Example 9: 

PFD1235W/MAL7P1.1 (SEQ ID NO,: 1), PF11„0008 (SEQ ID NO.: 3) and PF13„0003 (SEQ 
ID NO. : 5) belongs to a var sub-family that is common and highly conserved in 
25 P. falciparum isolates 

PFD1235W/MAL7P1.1 (SEQ ID NO.: 1), PF11_0008 (SEQ ID NO.: 3) and PF13_0003 (SEQ 
ID NO.: 5) are the dominant transcript and is highly upregulated in the P. falciparum 
isolate 3D7 following selection for antibody selection and adhesion to human endothelial 

30 ceMs (3D7sM and 3D7endo; Example 3, 4, 5, and 6) . All the 3D7 var genes differ from each 
other, but smaller blocks of sequences with high similarity are found in various var genes. 
To date, only three sub-families of PfEMPl have been defined {varl'3) (Salanti et al. 2002; 
Salanti et al. 2003; Kraemer and Smith, 2003). Apart from the vari-3 sub-families, all 
PfEMPl genes described so far from other parasite isolates differ from each other, and 

35 from the 3D7 var genes. It has therefore been assumed that the global repertoire of var 
genes is very large. This constitutes an obvious obstacle for the development of vaccines 
based on var genes and their products, as a high degree of conservation is a prerequisite 
for vaccine pan-reactivity. 

40 To test the degree of inter-genomic diversity of PFD1235w/MAL7Pl.l{SE<:i ID NO.: 1), 
PFll^OOOS (SEQ ID NO.: 3) and PF13_0003 (SEQ ID NO.: 5) ; 60 different Ghanaian 
parasite isolates obtained from the peripheral blood of P, falciparum malaria patients were 
tested. 
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m9-J.Genomic DNA was isolated (http-//www.clontech.com) using the NucleoSpin 

purification kits according to the manufacturer's recommendations. PGR was carried 
out in 0.2-ml microfuge tubes in a reaction volume of 20 ^1 using a PE2400 PGR 
5 machine (htt p://ww\ Ai pprkin-elmer.com ^. Final concentrations of the PGR reagents 

were as follows: HotstartTaq polymerase (ht;tp://www,qiaqep,cQm): 0.1 
primers: 1 [sM; dNTP: 2.5 mM, each; and MgGb: 1.5 mM). Cycling conditions were 
optimised for P. falciparum DNA: 15 min at 95°C followed by 30 cycles of 30 sec at 
94^C, 30 sec at 50°C, and 4 min at 68° C, with a final extension for 15 min at 68*='C. 

10 The PGR products were visualised and size was determined In a 1% agarose gel 

containing EB. PGR amplification using PFD1235w/MAL7Pl,l, PFll^OOOS, and 
PF13_0003 specific primers (Table 1) on genomic DNA from 60 isolates from 
Ghanaian children yielded a definite band of the expected size of 160 bp In 10, 3, 
20, respectively of the isolates. 

15 m9-2.To demonstrate the extent of sequence similarity, 2304 bp corresponding to 768 
amino acids were cloned and sequenced. Gene-specific primers for 
PFD1235W/MAL7P1.1 (SEQ ID NO.: 1) were used to perform PGR on genomic DNA 
from PGR positive isolates. PGR products were gel-purified using the Qiagen gel 
purification kit according to the manufacturer's instructions 

20 (htt p : //www ■ Qiaoen .com ^ . Purified PGR fragments were ligated into the pCRII TOPO 

vector using TOPO TA cloning kit, and transformed into TOPIO competent cells 
( http://www.invitroaen>com ). Positive clones were selected and propagated. 
Plasmid preparations were made using MIniPrep spin columns 
f http://www.qlaQen.com V Sequencing was performed on an ABI Prism 377 using 

25 the Big Dye terminator reaction mix ( http : //www . nerktn -el m er . co m ) . Proofreading 

and translation were done with ABI Prism software.- 

From the sequencing it was found that 556/768 cloned from BM021 and BM048 were 
Identical to the PFD1235w/MAL7Pl.l (SEQ ID NO.: 1) sequence (Fig. 19). Alignments were 
30 done using GlustalW and default settings. 

Taken together, these data show that PFD1235w/MAL7Pl.l (SEQ ID NO.: 1) belongs to a 
conserved and common gene sub-famiiy and thus fulfils two required criteria for any 
candidate gene in vaccine development. Similarly PF11„0008 (SEQ ID NO.: 3) and 
35 PF13_0003 (SEQ ID NO.: 5) were found to belong to var gene sub-families families with 
similar identities. 

Example 10: 

3D7sMi expresses VARA (SEQ ID NO.: 2) on the surface of infected erythrocytes 

40 

To generate recombinant proteins of VAR4 (SEQ ID NO.: 2) domains were cloned Into the 
pBAD-TOPO vector (Invitrogen) by PGR using the following domain-specific oligonucleotide 
primers and subsequently into the Baculovirus transfer vector pAcGP67: 
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NTS.Fw: 


5' 


-GAATTCATGGGGAATGCATCATCA-3 * 


NTS.Rv: 


5' 


-ATTTCTATCTTCTGCGTT-3 ' 


DBLl-a.Fw: 


5' 


-GAATTCAACGCAGAAGATAGAAATC-3 ' 


DBLl-a,.Rv: 


5' 


-CTTCGATGTATATGTCTG-3 ' 


CIDRl-a.Fw: 


5' 


-GAATTCGACGCTAAAACTGATAGTA-3 ' 


CIDRl-a-Rv: 


5' 


-ACATATATCGCCATTCAACG-S ' 


DBL2-p.Fw: 


5' 


-GAATTCTTGAATGGCGATATATGTA-3 ' 


DBL2-p.Rv: 


5' 


-GTATATAGCTGATACTGT-3 ' 


C2.Fw: 


5' 


-GAATTCACAGTATCAGCTATATACC-3 ' 


C2.Rv: 


5' 


-ATAACATGGTTGTTGTGA-3 ' 


DBL3-p,.Fw: 


5' 


-GAATTCTCACAACAACCATGTTATG-3 ' 


DBL3"p..Rv: 


5' 


-ATCTATTCCACCTGTAGT-3 ' 


DBL4-y.Fw: 


5' 


-GAATTCACTACAGGTGGAATAGATCA-3 ' 


DBL4-y.Rv: 


5' 


-ACATGCGGCATTGAGACT-3 ' 


DBL5-5.Fw: 


5' 


-GAATTCAGTCrCAATGCCGCATGTG-3 ' 


DBL5-5.RV: 


5' 


-TCTACAATGTCTGGCACA-3 ' 


CIDR2-P Fw: 


5' 


-GAATTCTGTGCCAGACATrGTAGATC-3 ' 


CIDR2-p.Rv: 


5' 


-TTTGCCACTAGGTACGT-3 ' 



mlO-1. For production of carboxy-termlnally V5 epitope and histidine-tagged protein the 
PCR 

5 amplified inserts were excised from pBAD-TOPO constructs by EcoRI and Pmel 

digestion and subsequently sub-cloned into the EcoRI and blunt-ended Bgfll sites 
of the Bacuiovirus transfer vector pAcGP67-A (BD Biosciences, Br0ndby, Denmark). 
Recombinant Bacuiovirus were generated by co-transfection of the 
pAcGP307-A-NTS, 

10 pAcGP67-A~DBLl-a, pAcGPSy-A-CIDRlS-a, pAcGP67-A-DBL2-p, pAcGP67-A-C2, 

pAcGP67-A-DBL3-p, pAcGP67-A-DBL4-Y, pAcGP67"A-DBL5-6, or pAcGP67-A-CIDR2-p 
D construct genes and 6sa330I linearized Bakpal<6 Bacuiovirus DNA (BD 
Biosciences Clontech) Into insect Sf9 cells. Recombinant products were expressed 
by infection of insect High-Five cells with recombinant Bacuiovirus using a 

15 multiplicity of Infection of ten. Recombinant proteins were purified from culture 

supernatants on Co2+ metahchelate agarose column and eluted with 25 mM 
HEPES-KOH (pH==7.30), 0.5 mM MgCij, 0.5 mM DTT, 100 mM NaCI, 10% 
glycerol, and 100 mM imidazole. 

20 mlO'2. For production of GST fusion protein a conserved sequence of the intracellular 
acidic 

segment ATS/Exon2 was sub-cloned Into the pGEX-4Tl vector by PCR using the 
following oligonucleotide primers ATS-Fw: 
5'-£GSAATrCAAAACAAAATCATCAGTAG-3', 
25 ATS-Rv: 5^- 4r^^g>MrGCggCCGC GTTGATrACCACTTAATGTG'-3\ The proteins were 

expressed as fusion proteins at the carboxy-terminus of glutathione S-transferase 
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from Schistosoma japon/cum and purified by affinity chromatography on 

glutathione sepharose 4B (Amersham Pharmacia Biotech). 
mlO'S.To generate murine antibodies against VAR4 domains and ATS/Exon2 the 
recombinant 

5 proteins were used to immunize Balb/c mice (5 pg) and rabbit s {50 pg) (given 

subcutaneously in Freund's complete adjuvant) followed by two booster Injections 
In Freund's incomplete adjuvant. 

A DNA vaccination approach to generate antibodies to VAR4 (SEQ ID NO.: 2) domains 
10 were also used. All domains was cloned into the Eucaryotic TA expression vector pCR3.1 
(Invitrogen) using the following primers; 

DBLl-a, Fw : 5 ' -GCCRCCATGGACGCAGAAGATAGAAATC-3 ' 
DBLl-a..Rv: 5 ' -CTACrTCGATGTATATGTCT-3 ' 
CIDRl-a,Fw: 5 ' -GCCRCCATGGACGCTAAAACTGATAGTA-3 ' 
CIDRl-a.Rv: 5 ' -CTAACATATATCGCCATTCAACG-3 ' 
DBL2-p,Fw: 5 ' -GCCRCCATGGTGAATGGCGATATATGTA-3 ' 
DBL2-P.RV : 5 ' -CTAGTATATAGCTGATACTGT-3 ' 
C2.FW: 5 ' -GCCRCCATGGCAGTATCAGCTATATACC-3 ' 

C2.Rv: 5 ' -CTAATAACATGGTTGTTGTGA-3 ' 

DBL3-p..Fw: 5 ' -GCCRCCATGGCACAACAACCATGTrATG-3 ' 
DBL3-p,.Rv: 5 ' -CTAATCTATTCCACCTGTAGT-3 ' 
DBL4-y.Fw : 5 ' -GCCRCCATGGCTACAGGTGG AATAG ATCA-3 ' 
DBL4-y,Rv: 5 ' -CTAACATGCGGCATTG AG ACT-3 ' 
DBL5-5.FW: 5 '-GCCRCCATGGGTCTCAATGCCGCATGTG-3 ' 
DBL5-6.RV : 5 ' -CrATCTACAATGTCTGGCACA-3 ' 
CIDR2-P.FW : 5 ' -GCCRCCATGGGTGCCAG ACATTGTA6ATC-3 ' 
CIDR2-p.Rv: 5 ' -CTATTTGCCACTAGGTACGT-3 ' 

mlO-4. Plasmids were propagated in TOPIC cells (Invitrogen) and plasmid was purified 
15 using Plasmid GIGA prep kit (QIagen). Plasmid DNA was injected IM to mice 4 

times with 2 weeks intervals and frnally boosted with the recombinant protein 
corresponding to the domain. 
mlO'5. SDS-extracted trophozoite/schizont-stage iRBC were reduced by boiling in the 

presence of L-mercaptoethanol and electrophoresed in the Laemmll sodium dodecyl 
20 sulfate polyacrylamide gel electrophoresis (SDS-PAGE) system (Sambrook et al., 

1989). Immunoblots were prepared on PVDF membranes by semidry blotting using 
standard methods. All available binding sites on membranes were blocked In Tris- 
buffered-saline-Tween (TBS-f ) containing 5% skimmed milk. Blots were probed 
with an antiserum raised against a recombinant protein of the highly conserved 
25 intracellular acidic segment ATS/Exon2 conserved among most var genes in 3D7, 

D^L5-b of PFD1235w/MAL7Pl,l (SEQ ID NO.: 1), and pre-immune mouse sera 
diluted 1:100 in TBS-T. Bound antibody was detected with relevant IgG alkaline 
phosphatase-conjugated antibody (Dako, Glostrup, Denmark). All washes were 
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done using TBS-T. Blots were developed by standard methods using p-nitroblue 
tetrazolium phosphate (NBT, Sigma) and 5-bromo-4-chloro-3-indolylphosphate 
(BOLP, Sigma) as substrates. 
mlO-6. Immunostaining and flow cytometry were performed as described (Staalsoe et al., 

5 1999; Nielsen et al., 2002) with some modifications. Briefly, 2.5 x 10^ of MACS 

purified, ethidlum bromide labeled IRBC were incubated for 1 hr in 20 \i\ of murine 
sera or for 30 min in 5 uL of human sera. All murine sera had been depleted of 
anti-human erythrocyte antibodies by absorption. For immunostaining with murine 
sera, iRBC were sequentially exposed to 100 iiL of 1:25 diluted goat anti-mouse Ig 

10 (Dako), biotinylated anti-goat Ig (Dako) and 1:200 diluted FITC-conjugated 

streptavldin (Dako) for 30 minutes each. For immunostaining with human sera, 
IRBC were incubated In 100 uL of 1:25 diluted biotinylated anti-human IgG (Dako) 
then in 1:2000 diluted FITC-conjugated streptavldin for 30 min each time. For 
fluorescence and confocal microscopy, wet mounts of immunostalned parasites 

15 were prepared and images were obtained using a Leica DM LB2 and a Carl Zeiss 

Scanning Microscope, respectively. For both microscopes, suitable filters and 
channels were used to detect FITC and ethidium-bromide staining. 
When appropriate, trypsin treatment was performed as described (Fernandez et al., 
1999) with modifications. Cells were washed once in PBS, incubated in 10 volumes 

20 of 100 Mg/ml TPCK-treated trypsin (Amersham Pharmacia Biotech) in PBS for 10 

min at 37°C. The reaction was stopped with 1 volume of 2 mg/ml soybean trypsin 
inhibitor (Sigma-Aldrich) in RPMI 1640/5% Albumax. Cells were washed twice in 
PBS then used for Immunostaining and flow cytometry. 

25 We next used Western blotting to investigate the protein translation of VAR4 (SEQ ID NO.: 
2) {mlO-5). Analysis of PfEMPl expression of unsetected late trophozolte/schlzont stage 
3D7 using andserum against the conseived Intracellular ATS/Exon2 var domain showed a 
single band with an estimated molecular weight of 260 kDa {Fig. 17, Lane A). This 
indicates that most of these parasites expressed a PfEMPl with a four-domain structure, in 

30 correspondence to the results of the absolute quantification (Example 5, Table 2) that 
showed a dominant group C gene transcript encoding a PfEMPl molecule of this size. 
Antibody-selected 3D7 expressed a PfEMPl species of 260 kDa, but also expressed 
additional high-molecular weight bands (Fig. 17, Lane B). Again, these data agree with the 
results of the absolute quantification showing that the three most highly transcribed var 

35 genes in antibody-selected 3D7 were PF08_0107, MAL6P1.316, and more specifically 
PFD1235W/MAL7P1.1 (SEQ ID NO.: 1), which encode proteins with predicted molecular 
weights of approximately 260, 330, and 400 kDa, respectively. Furthermore, an antiserum 
against Bacutovirus-derived DBL5-S of the PFD1235w/MAL7Pl.l (SEQ ID NO.: 1) gene 
product revealed a high-molecular weight band (Fig. 17, Lane D) corresponding to the top 

40 band In the ATS/Exon2-probed blot (Fig. 17, Lane B). This indicates that 

PFD1235W/MAL7P1.1 (SEQ ID NO.: 1) is not only a major transcript in antibody-selected 
3D7, but that this var gene is also translated into protein. 



36017DK01 



94 

Antibodies to Baculovirus-derived DBL5-6 of the PFD1235w/MAL7Pl,l {SEQ ID NO.: 1) 
gene product were tested on IRBC by flow cytometry, immuno-fluorescence and confocal 
microscopy im9-6). Only a small percentage of RBC infected by unselected 3D7 reacted 
with the murine DBL5-5 antibodies {Fig. 18A1), although these iRBC did express V5A 
5 detectable by flow cytometry with human plasma (Fig. 18A3). By contrast, a large 
proportion of RBC infected with antibody-selected 3D7 were recognized by the murine 
antibodies, and this reactivity was abrogated by prior trypsin treatment of the iRBC (Fig. 
18B1). Fluorescence and confocal microscopy using the DBL5-6 murine antibodies showed 
a distinct punctate pattern on the surface of intact RBC infected by antibody-selected 3D7 
10 (Figs. 1882, 18B5), but not by unselected 3D7 (Figs. 18A2, 18A5). 

Taken together, these results mdicate that the product of PFD1235w/MAL7Pl.l (SEQ ID 
NO.: 1 and 2) is expressed on the surface of antibodyseiected 3D7 and that the protein 
confers a VSAsm phenotype. Similar results were obtained for PF11_0008 (SEQ ID NO.: 3 
15 and 4) and PF13_0003 (SEQ ID NO,: 5 and 6) (data not shown). 

Example 11: 

Monoclonal antibodies to VARA (SEQ ID NO.: 2), VAR5 (SEQ ID NO.: 4), VAR6 (SEQ ID 
NO. : 6) 

20 

For production of monoclonal antibodies (mAb) mice were immunized by injection of 
different domains of VAR4, VAR5, and/or VAR6 to stimulate the production of antibodies 
targeting these domains. 

25 mll'l. The antibody forming cells were isolated from the spleen of the mice. Monoclonal 
antibodies were produced by fusing single antibody-forming ceils to cancer cells 
(such as cells from myeloma) to make them immortal. The cells were grown using 
in vitro cell-culture techniques and cloned by limiting ditutfon. mAb secreted from 
the hybrldomas were purified and used for identification by flow-cytometry of field 

30 P. falciparum isolates expressing VAR4, VAR5, and/or VAR6 or homologous hereof. 

This strategy resulted in mAb with different specificities and different avidity and the 
resulting mAb were used to identify epitopes involved in adhesion to receptors on human 
endothelial cells and to discriminate between even quite similar epitopes. 

35 

Example 12: 

VAR4 (SEQ ID NO.: 2) recombinant fusion proteins are commonly and specifically 
recognised 

40 If VAR4 (SEQ ID NO 2) mediates a VSAsm phenotype, it would be predicted that a high 
proportion of children in malaria endemic areas have acquired antibodies to this protein. 
To test this hypothesis we measured plasma levels of IgG against recombinant domains of 
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the VAR4 protein by EUSA \n asymptomatic individuals living under high malaria 
transmission intensity in Tanzania. 

To make recombinant proteins of VAR4 (SEQ ID NO.: 2) the PFD1235w/MAL7Pl.l {SEQ ID 
5 NO.: 1) NTS, DBLl-a, CIDRl-a, DBL2-p,C2, DBL3-p, DBL4-y, DBL5-5, and CIDR2-p were 
sub-cloned into the pGEX-4Tl vector by PCR using the following domain-specific 
oligonucleotide primers and a hot start taq polymerase (Qiagen) and PfuTurbo 
(Stratagene): 

NTS.Fw: 5 * -GAATTCATGGGGAATGCATCATCA-3 ' 
NTS . Rv: 5 ' -ATAAG AATGCGGCCGCATTTCTATCTTCTGCGTT-3 ' 
DBLl-a.Fw: 5 ' -GAATTCAACGCAGAAGATAGAAATC-3 ' 
DBLl-a,.Rv: 5 ' -ATAAGAATGCGGCCGCCTrCGATGTATATGTCTG-3 ' 
aORl-a.Fw: 5 ' -GAATrCGACGCTAAAACTGATAGTA-3 ' 
aORl-a.Rv: 5 '-ATAAGAATGCGGCCGCACATATATCGCCATTCAACG-B ' 
DBL2-p.Fw: 5 ' -GAATTCTTGAATGGCGATATATGTA-3 ' 
DBL2-p.Rv: 5 '-ATAAGAATGGGGGCGCGTATATAGCTGATACTGT-3 ' 
C2.Fw: 5 ' -GAATTCACAGTATCAGCTATATACC-3 ' 

C2. Rv : 5 ' -ATAAGAATGCGGCCGCATAAC ATGGTTGTTGTGA-3 ' 

DBL3- p., Fw : 5 ' -GAATTCTCACAACAACC ATGTTATG- 3 ' 
DBL3-p..Rv: 5 ' -ATAAGAATGCGGCCGCATCTATrCCACCTGTAGT -3 ' 
DBL4-y.Fw : 5 ' -G AATTCACTACAGGTGGAATAGATCA-3 ' 
DBL4-Y.RV : 5 ' - ATAAGAATGCGGGCGCAC ATGCGGCATTGAGACT-3 ' 
DBL5-6.FW : 5 ' -GAATTCAGTCTCAATGCCGCATGTG-3 ' 
DBL5-6.Rv: 5 ' -ATAAGAATGCGGCCGCTCTACAATGTCTGGCACA-3 ' 
CIDR2-p.Fw: 5 ' -TCCCCCGGGTGTGCCAGACATTGTAGATG-3 ' 
CIDR2-p.Rv: 5 ' -ATAAGAATGCGGGCGCTTTGCCACTAGGTAGGT-3 ' 
10 The proteins encoding single domains were expressed as fusion proteins (E. co// stram 
BL21) at the ca rboxyterminus of glutathione S-transferase from Schistosoma Japonicum, 
and purified by affinity chromatography on glutathione sepharose 4B (Amersham 
Pharmacia Biotech) 

15 Materiais and methods 

mlZ'l, We used plasma samples from 20 children (3-4 and 10-11 years of age) and 10 
young adults (18-19 years of age) living in Mgome village in the Tanga region of 
Tanzania for ELISA analysis of antibody responses to punfied recombinant NTS, 

20 DBL3-P, CIDRl-a and DBL5-5 of VAR4-GST domains as previously described. 

Briefly, proteins were diluted in 0.1 M glycine/HCI (pH 2.75). The wells of Maxisorp 
micro titre plates (Nunc, Rosktlde, Denmark) were coated with antigen by overnight 
Incubation at 4«C. The plates were emptied, and any residual binding capacity was 
blocked with 100 ^il of blocking buffer (1% bovine serum albumin, 0.5 M NaCI, 1% 

25 Triton-X-lOO in phosphate-buffered saline (PBS), pH 7.2) per well. After Incubation 

for 0.5 hr at room temperature, the plates were washed four times with washing 
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buffer (PBS, 0.5 M NaCl, 1% Triton-X-100, pH 7.4) and 100 ^il of plasma diluted 
1:200 in blocking buffer was added to each well. The plates were then incubated 
for one hour at room temperature, and then washed and incubated for one more 
hour at room temperature with peroxidase-conjugated rabbit anthhuman 
5 immunoglobulin G (IgG) (Dako, Glostrup, Denmark) diluted 1:1000 in blocking 

buffer. Subsequently, the plates were washed and 100 jil of o-phenylenedlamine 
substrate (0,30%, Dako) diluted in 0.1 M sodium citrate buffer (pH 5.0) with 
0.05% (v/v) H2O2, was added to each well. Finally, the plates were incubated at 
room temperature in the dark before the addition of 100 ^il of 2.5 M H2SO4 and the 
10 optical density (OD) was measured at 492 nm. Control plates were coated with GST 

alone* Cut-off values were calculated as the mean ELISA unit plus 2 standard 
deviations obtained with sera of 13 Danish blood donors without malaria exposure 
as described in (Jakobsen et al., 1993). 

15 ml2'2. Competition ELISA was done using recombinant CIDRl-a domains of 

VAR4 {SEQ ID NO.: 2) and varl {PFE1640w) using three different plasma samp!es 
of high, medium, and low reactivity with the proteins from two children aged 3 
years and one child aged 11 years. Blocking of plasma was done using 0.1, 1, 5, 
and 10 \xg/m\ of recombinant protein for 2 hr at room temperature. The test 

20 plasma samples were diluted 1:50 and tested as described above. 

Most parasite-exposed children and adults had IgG directed against DBL5-6 (Fig. 18A), 
CIDRl-a (Fig. 18B), DBL3-p(Fig. 18C), and NTS (Fig. 18D), with comparable levels in 
young children and adults. To test the antigen specificity of the antibody recognition, we 
25 performed competition EUSA using the CIDRl-a domains of the VAR4 (SEQ ID NO.: 2) 

and varl {PFE1640w) proteins. Pre- incubation of the plasma with homologous recombinant 
protein caused a dose-dependent reduction in OD492 values (Figs. 18E, 18F}, whereas the 
heterologous protein (Figs. 18E, 18F) did not change the plasma reactivity. 

30 These result showing that a high proportion of young children recognises recombinant 
domains of VAR4 and the demonstration that our DBL5-5 antiserum did not cross react 
with products of MAL6P1316, and PF08_0107 in Western blotting, suggest that the plasma 
antibody reactivity to recombinant VAR4 (SEQ ID NO.: 2) protein was the result of 
exposure to parasites expressing PfEMPl resembling that encoded by PFD1235w/MAL7Pi.l 

35 (SEQ ID NO.: 1), rather than being due to a broad cross-reactivity between different CIDR 
or DEL domains. Similar results were obtained when using VAR5 (SEQ ID NO.: 4) and 
VAR6 (SEQ ID NO.: 6) (data not shown). 

Example 13: 

40 Experimental infection of humans results in transcription and translation of 
PFD1235W/MAL7P1.1 (SEQ ID NO.: 1 and 2) 
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We have proposed that the dominance of parasites expressing VSAsw-type antigens among 
non-immLine patients is related to their higher growth rate in such individuals, and that the 
shift towards VSAuM-type antigens occurs as this strong selective advantage of VSAsm" 
expressing parasites gradually disappears as VSAsM-specific immunity is acquired. 

5 

Materials and methods: 

ml3'l. We used parasites isolated on days 8, 9, and 10 from a Dutch volunteer exposed 
on day 0 to mosquitoes infected by P. faldparum isolate NF54 as part of ongoing 
10 studies of experimental P. faldparum infections (Hermsen et al., 2001). These 

parasites were cultured In vitro for 27 (day 8 and day 9 isolates) or 33 days 
(day 10 isolate) to obtain sufficient parasites for DNA/RNA analysis. 

ml3-2. SDS-extracted trophozoite/schizont-stage iRBC were reduced by boiling in the 
15 presence of L-mercaptoethanoI and electrophoresed in the Laemmli sodium dodecyl 

sulfate polyacrylamide gel electrophoresis (SDS-PAGE) system (Sambrook et al., 
1989). Immunoblots were prepared on PVDF membranes by semidry blotting using 
standard methods. All available binding sites on membranes were blociced In Tris- 
buffered-sallne-Tween (TBS-T) containing 5% skimmed milk. Blots were probed 
20 with an antiserum raised against a recombinant protein of the highly conserved 

intracellular acidic segment ATS/Exon2 conserved among most var genes in 3D7, 
DBL5-6of VAR4 (SEQ ID NO.: 1), and pre-immune mouse sera diluted 1:100 in 
TBS-T. Bound antibody was detected with relevant IgG alkaline phosphatase- 
conjugated antibody (Dako, Glostrup, Denmark). All washes were done using TBS- 
25 T. Blots were developed by standard methods using p-nitrobiue tetrazolium 

phosphate (NBT, Sigma) and 5-bromo-4-chloro-3-indolylphosphate {BCIP, Sigma) 
as substrates. 

To further study this hypothesis we studied parasites rescued on days 8, 9, and 10 from a 
30 Dutch volunteer receiving a mosquito-transmitted P. faldparum NF54 infection on day 0. 
NF54 was originally isolated from a non-immune Dutch malaria patient, and isogenic with 
the 3D7 parasites cloned from it ( Salanti et al., 2003). Western blots of Day 10 parasites 
probed with antiserum against the DBL5-6 domain of the PFD1235w/MAL7PlA gene 
product, detected a high-molecular weight band (Rg. 17, Lane G), which corresponded in 
35 size to that observed in antibody-selected 3D7 (Fig. 17, Lane B). This band was not 

detected in similar blots of parasites obtained on Day 8 and Day 9 (Fig. 17, Lanes E-F), In 
line with these findings, we observed high PFD1235w/MAL7Pl,l (SEQ ID NO.: 1) copy 
numbers relative to those of the ser/f tRNA synthetase gene on Day 10, but not on Day 8, 
corresponding to a 38.30 and 8.4 fold increase in mRNA copies of this v^argene relative to 
40 the copy number of seryl-tRNA synthetase mRNA between Day 8 and Day 10 in rings and 
trophozoites, respectively (data not shown). Judged by comparison of Ct-values, ring- 
stage transcription of PFD1235w/MAL7Pl.l (SEQ ID NO.: 1) was relatively low on Day 8 
(rank: 55/60), but high on Day 10 (rank: 9/60), The corresponding rankings in 
trophozoites were 48 on Day 8 and 18 on Day 10. 
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Taken together, these data show that PFD1235w/MAL7Pl.l (SEQ ID NO.: 1) is translated 
into a functional protein in vivo, and indicate that this protein may be associated with fast- 
growing parasites dominating acute P. falciparum malaria in non-immune patients- Similar 
5 results were obtained for PFl 1^0008 (SEQ ID NO.: 3) and PF13_0003 (SEQ ID NO.: 5) 
(data not shown). 

Example 14: 

10 Anti-adhesion assay 

It \$ becoming increasingly apparent that acquired protective immunity to P, falciparum 
infection relies on antibodies (Abs) specifically recognizing variant parasite antigens 
expressed on the surface of late stage-infected erythrocytes. In this scenario, only 
parasites expressing variant antigens to which the host does not possess adequate specific 

15 Ab are lii<ely to cause disease, and immunity Is iikely to depend on the accumulation of a 
large panel of Ab specificities recognizing different variants of such antigens. Severe 
malaria is often associated with sequestration of large quantities of parasites in the brain. 
Parasites causing severe malaria have been shown to adhere receptors on endothelial 
cells. To show that VAR4, VAR5, and VAR6 are responsible for in vitro adhesion of 3D7swi 

20 and 3D7endo parasites to endothelial receptors, an antibody adhesion assay with murine 
antibodies against VAR4, VAR5, and VAR6 were performed. 

Materials and methods: 

ml4'l. Antiadhesion was measured by labeled parasites: For use in adhesion assays, 

25 parasite cultures with a parasitemia of'^1% late trophozoites and schizonts were 

first transferred from Albumax II medium (Life Technologies), with a high 
concentration of hypoxanthine (Hpx), into RPMI 1640 plus 5% normal human 
serum (low Hpx) and maintained for 24 h. The parasites then were labeled by 
exposure to [3H]Hpx (Amersham; 8.75 MBq/mL of RBCs) for another 24 b. Finally, 

30 the cultures were enriched for late-stage iRBCs and incubated for 30 min, with or 

without test plasma. Endothelial cells were grown on Micrptiter plates (Falcon; 
Becton Dickinson) then blocked with bovine serum albumin (BSA; 20 mg/mL, 100 
lLiL/well) in PBS at room temperature for 30 min. We added enriched ['H]Hpx- 
labeled late-stage iRBCs to wells containing endothelial cells (2 x 10^ cells/well) 

35 and incubated the wells at 37°C for 1 h. Nonadherent iRBCs were removed by 4 

washes in RPMI 1640. Adherent iRBCs were harvested onto glass fiber pads, and 
the [^H]Hpx activity was measured in a liquid scintillation counter (Beckman 
Coulter). Inhibition of IRBC adhesion by plasma was calculated as 1 - 
(testEndothelialCell - controlBSA)/controlEndothenalCell - controlBSA), where 

40 testEndothelial is counts per minute of iRBCs preincubated with plasma and 

adhering to wells containing endothelial cells, and controlEnothelial cells and 
controlBSA refer to counts per minute of iRBCs not preincubated with plasma and 
adhering to endothelial cells and BSA-coated wells, respectively- 
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Cytoadhesion of endothelial cells were significantly inhibited by plasma from individuals 
suffering from severe malaria, and more importantly binding of 3D7smi and 3D7endo to 
endothelial cells were strongly inhibited by the murine anti-VAR4, -VAR5, and ~VAR6 
5 antibodiesAntibodies raised against recombinant VAR4, -VAR5, and -VAR6 inhibit parasite 
adhesion to endothelial cells in vitro (data not shown). 

An obvious consequence of this finding is ttiat vaccine induced antibodies against SEQ ID 
NO. : 2, 4, and/or 6 constructs can hinder binding of parasites to endotheiiai tissue and 
10 thus prevent severe maiaria. 

Example 15: 

Identification of receptors and VAR4 (SEQ ID NO.: 2), VAR5 (SEQ ID NO,: 4), and/or VAR6 
(SEQ ID NO,: 6) sites binding receptors on endotheiiai ceiis in vitro 
15 To further study endothelial cell adhesion all domains were cloned Into the pDISPLAY 
vector (Invitrogen) using the following primers as examplified for VAR4: 

NTS. Fw : 5 ' -TCCCCCGGGATGGGGAATGCATCATCA-3 ' 
NTS. Rv: 5 ' -TCCCCGCGGATrrCTATCTTCTGCGTT-3 ' 
DBLl-a. Fw : 5 ' -TCCCCCGGGAACGCAGAAGATAGAAATC-3 ' 
DBLl-a..Rv: 5 ' -TCCCCGCGGCTTCGATGTATATGTCT-3 ' 
CIDRl-a.Fw: 5 '-TCCCCCGGGGACGCTAAAACTGATAGTA-3 ' 
CIDRl-a.Rv: 5 ' -TCCCCGCGG ACATATATCGCCATTCAACG-B ' 
DBL2-p.Fw: 5 ' -TCCCCCGGGTTGAATGGC6ATATATGTA-3 ' 
DBL2-P-RV: 5 ' -TCCCCGCGGGTATATAGCTGATACTGT-3 ' 
C2 . Fw : 5 ' -TCCCCCGGGACAGTATCAGCTATATACC-3 ' 

C2. Rv : 5 ' -TCCCCGCGGATAACATGGTTGTTGTGA-3 ' 

DBL3-p..Fw : 5 ' -TCCCCCGGGTCACAACAACCATGTTATG-3 ' 
DBL3-p„Rv : 5 ' -TCCCCGCGGATCTATTCCACCTGTAGT-3 ' 
DBL4-y.Fw: 5 ' -TCCCCCGGGACTACAGGTGGAATAGATCA-3 ' 
DBL4-y.Rv: 5 ' -TCCCCGCGGACATGCGGCATTGAGACT-B ' 
DBL5-5.FW: 5 ' -TCCCCCGGGAGTCTCAATGCCGCATGTG-3 ' 
DBL5-5.Rv: 5 ' -TCCCCGCGGTCTACAATGTCTGGCACA-3 ' 
CIDR2-p.Fw: 5 '-TCCCCCGGGTGTGCCAGACATTGTAGATC-3 ' 
CIDR2-p.Rv: 5 ' -TCCCCGCGGTTTGCCACTAGGTACGT-B ' 

The ability of the different domains to bind directly to endothelial cells was in this example 
20 assayed using a mammalian expression system. 

ml5-l. Domains were cloned into the pDisplay vector (Invitrogen). This vector allows 
display of cloned proteins on the cell surface. Each domain was fused at the N- 
terminus to the murine Ig K-chain leader sequence, which targets the protein to the 
25 cell surface, and at the C-termmus to the platelet derived growth factor receptor 

(PDGFR) transmembrane domain, which anchors the protein to the cell membrane. 
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A human non-adherent T cell and a CHO cell line was used for transient expression of the 
recombinant proteins. This approach has enabled us to study cell adhesion to endothelial 
cells. Blockage of this binding was studied using relevant recombinant domains expressed 
5 in the Baculovirus system (Example 10) and commercial antibodies directed against 
specific endothelial receptors- 

VAR4 (SEQ ID NO.: 2), VAR5 (SEQ ID NO.: 4), and/or VAR6 (SEQ ID NO.: 6) regions 
responsible for receptor binding were identified using random mutagenesis. 

10 Example 16: 

To express VAR4 (SEQ ID NO.: 2), VAR5 (SEQ ID NO,: 4), and/or VAR6 (SEQ ID NO,: 6) In 
eucaryotic organisms the Exon 1 ranging from nt 1 to 9444 (SEQ ID NO.: 1), ^719 (SEQ 
ID NO.: 3), and 8844 (SEQ ID NO.: 5) were subjected to a full recodonisatfon: 

15 An artificial codon table was generated by combining the codon usage of Tricfioplusia ni 
and Homo sapiens genes. The codon bias of the synthetic genes were adapted to this 
"artificial" codon usage. In addition, regions of very high (> 80%) or very low (<30%) GC 
content was avoided and the GC-content was adjusted to 50% where possible. During the 
optimization process following cIs-acting sequence motifs were avoided: internal TATA- 

20 boxes, chi"Sites and ribosomal entry sites, AT-rich or GC-rich sequence stretches, repeat 
sequences and RNA secondary structures, and 
(cryptic) splice donor and acceptor sites, branch points 

No reverse-complementary sequence identities longer than 20 nucleotides are found when 
25 the optimized sequence is aligned to the transcriptom of Homo sapiens. No RNA 
interference should therefore be expected. The entire gene was divided into and 
constructed as four -2kb long fragments using PstI (2028), KasI (3759) and PvuII (5899) 
and cloned into pCR-Script-Amp (Stratagene, CA, USA) Kpnl and Sad restriction sites. 

30 The recodonised PFD1235w/MAL7Pl.l, PFll^OQOSr and PF13_0003 NTS, 
DBLl-a, CIDRl-ct, CIDRl-y, DBL2-p, DBL2-y, C2, 

DBL3-P, DBL3-y, DBL4-y, DBL4-5, DBL4-p, DBL5-5, DBL5-p, and CIDR2p were expressed in 
Baculovirus infected hi-five insect cells and purified by HIS tag Metal Chelate Afl^lnlty 
Chromatography purification by cloning the domains into the pAcGP307-A (BD Biosciences, 
35 Br0ndby, Denmark) and Eucaryotic TA expression vector pCR3.1 (Invitrogen). 

Example 17: 

Field Isolates of P. falciparum causing severe ma/aria shows high transcription of 
40 PFD1235W/MAL7P1.1 (SEQ ID NO. 1), PF11_0008 (SEQ ID NO.: 3), and/or PF13_0003 
(SEQ ID NO.: 5) or homologues hereof. 
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From Example 3, 4, 5, and 6 it appears that the Group A PFD1235w/MAL7Pl.l (SEQ ID 
NO. 1), PFl 1^0008 (SEQ ID NO.: 3), and/or PF13_0003 (SEQ ID 5) gene encodes VSAsm- 
like proteins, and that these proteins are Involved in the acquisition of the VSAsm 
phenotype following antibody-selection of VSAuw-expressing 3D7 (3D75m) and adhesion to 
5 endothelial cells {3D7endo)- 

Materials and methods 

mi 7.1. Real-time PGR was done using a Rotorgene thermal cyder system 

( h ttp : // w w w . corb ettresea rch.com > following purification of RNA and synthesis of 

10 cDNA (mS.I). The transcription level of PFD1235w/MAL7Pl.l (SEQ ID NO. 1), 

PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) or homologues of 
hereof yNBre measured using primers targeting this sequence. The absolute copy 
number was calculated {m5,3) and compared to the absolute copy number of other 
var genes present in the field isolates as well as housekeeping genes such as seryl- 

15 tRNA synthetase. Reactions were performed as In (m5.2) 

The transcription of PFD1235w/MAL7Pl.l (SEQ ID NO, 1), PF 11^0008 (SEQ ID NO,: 3), 
and/or PF13_0003 (SEQ ID NO.: 5) or homologues hereof were found to be higher than 
other var genes in parasites causing severe malaria, but low in parasites causing 
20 uncomplicated malaria (data not shown). 

Example 18: 

Plasmodium falciparum field isolates causing severe malaria expresses VAR4 (SEQ ID NO.: 
2) on the surface of infected erythrocytes 

25 

Analysis of VAR4 (SEQ ID NO.: 2) expression of RBC infected with parasites causing severe 
malaria usmg antibodies to Baculovirus-derived DBL5-6 of the PFD1235w/MAL7Pl.l gene 
product revealed a high-molecular weight band of approximately 400 kDA corresponding to 
the size of the VAR4 expressed by 3D7sm (Example 5), which was absent m parasites with 
30 a VSAuM phenotype (data not shown). 

Taken together, the results of Example 17 and 18 indicates that PFD1235w/MAL7Pl.l 
(SEQ ID NO.: 1) or homologues hereof are not oniy transcribed, but also expressed on the 
surface of RBC Infected with parasites having a VSAsm phenotype and causing severe 
35 malaria, but not parasites causing uncompiicated mafaria and having a VSAum phenotype. 
Similar results were obtained for PFl 1^0008 (SEQ ID NO.: 3), and PFl 3^0003 (SEQ ID 
NO.: 5) (data not shown). 

Example 19: 

40 Plasmodium falciparum field isolates causing severe malaria adhere to endothelial cells and 
adhesion is inhibited by SEQ ID NO. 2, 4, and/or 6 specific antibodies 
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To show that VAR4, VAR5, and/or VAR6 are responsible for in vitro adhesion of field 
isolates to endothelial receptors, an antibody adhesion assay with nnurine antibodies 
against VAR4, VAR5, and VAR6 were performed. 

5 Materials and methods: 

mlS'l. Antiadhesion was measured by labeled parasites: For use in adhesion assays, 
parasite cultures with a parasitemia of ^1% late trophozoites and schizonts were 
first transferred from Albumax II medium (Life Technologies), with a high 
concentration of hypoxanthine (Hpx), into RPMI 1640 plus 5% normal human 

10 serum (low Hpx) and maintained for 24 h. The parasites then were labeled by 

exposure to [3H]Hpx (Amersham; 8.75 MBq/mL of RBCs) for another 24 h. Finally, 
the cultures were enriched for late-stage iRBCs and incubated for 30 min, with or 
without test pFasma. Endothelial cells were grown on Microtiter plates (Falcon; 
Becton Dickinson) then blocked with bovine serum albumin (BSA; 20 mg/mL, 100 

15 nL/well) In PBS at room temperature for 30 mln. We added enriched [^H]Hpx- 

labeled late-stage iRBCs to wells containing endothelial cells (2 x 10^ cells/well) 
and incubated the wells at 37°C for 1 h. Nonadherent iRBCs were removed by 4 
washes in RPMI 1640. Adherent iRBCs were harvested onto glass fiber pads, and 
the [^H]Hpx activity was measured in a liquid scintillation counter (Beckman 

20 Coulter). Inhibition of iRBC adhesion by plasma was calculated as 1 - 

(testEndothelialCell - controlBSA)/controlEndothelialCell - controlBSA), where 
testEndothelial is counts per minute of iRBCs preincubated with plasma and 
adhering to wells containing endothelial cells, and control Enothetial cells and 
controlBSA refer to counts per minute of iRBCs not preincubated with plasma and 

25 adhering to endothelial cells and BSA-coated welis, respectively. 

Cytoadhesion to endothelial cells were significantly inhibited by plasma from young semi- 
immune children, and more importantly binding of field Isolates to endothelial cells were 
strongly Inhibited by the murine anti-VAR4, -VAR5, and -VAR6 antibodies. In this example 
30 it is shown that antibodies raised against recombinant VAR4, -VAR5, and -VAR6 inhibit 
field parasite Isolate adhesion to endothelial cells in vitro. 

An obvious consequence of this finding is that vaccine induced antibodies against SEQ ID 
NO.: 2, 4, and/or 6 constructs can hinder binding of parasites to endotheiial tissue and 
35 thus prevent severe malaria. 

Example 20: 

A novel OynaBead based method and adhesion to endothelial cells selects for RBC infected 
with parasites having a VSAsm phenotype and which transcribes var genes with similar 
40 characteristics 

Selection of parasites using a novel antibody-DynaBead based method {Example 3) and 
adhesion to endothelial cells (Example 6) leads to changes in the serological phenotype of 
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the 3D7 parasites from berng of a VSAum" to a VSAsM-type (Example 3 and Figs. SC-D). 
This VSAsM phenotype resembles that of parasites causing severe malaria (Example 1 and 
Fig. 1}. Thus, this method and results provided by it forms the basis for identification of 
PfEMPl molecules that could be used as part of a vaccine against severe malaria. 

5 

Using this method we were able to identify three different var genes PFD1235w/MAL7Pl.l 
(SEQ ID NO.: 1), PF11J0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) that were 
highly transcribed and upregutated in 3D7sM(Flgs. 13, 14, and 20, Example 5 and 6, Tabie 
2). The protein products of these three genes are responsible for the serological phenotype 

10 and adhesive properties of 3D7sm» Interestingly, these three genes do not belong to the 
van, var2 gene subfamily as defined previously (Salanti et al, 2002, Salanti et al. 2003) or 
Group B, Group C, and Group B/C var genes as defined recently (Lavstsen et al. 2003). 
They lacic 1-2 cysteine residues in DBLa homology group G (Smith et aL 2000) compared 
to most PfEMPl molecules In Group B, C, and B/C. Additionally, they posses a CIDRia 

15 distinctly different from the CIDRla domain of the MCvarl PfEMPl gene product expressed 
by Malayan Camp parasites and of the A4VAR expressed by A4 parasites that have been 
shown to bind CD36 (Baruch et al. 1997; Cooke et aL 1998; Bryan et aL 2003; Smith et 
al. 1998). By contrast, 3D7sMthat transcribes PFD1235w/MAL7Pl.l (SEQ ID NO.: 1), 
PF11_0008 (SEQ ID NO.: 3), and/or PF13_0003 (SEQ ID NO.: 5) do not bind to CD36 

20 (Example 3). 

In conclusion, the three genes are present and expressed in field isolates causing severe 
malaria, but not in parasites with a VSAum phenotype (Example 17, 18, and 19) indicating 
that the serve similar functions and might he responsible for the pathogenesis of severe 
25 malaria. 
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Claims 

1. An isolated polypeptide comprising an annino acid sequence selected from the group 
consisting of at least one of 

5 SEQ ID NO.: 2, SEQ ID NO.: 4 and SEQ ID NO.: 6 

for use as a medicament. 

2. An isolated polypeptide according to claim 1, wherein said ammo acid sequence has at 
10 least 80% sequence identity to SEQ ID NO.: 2, SEQ IN NO.: 4 and SEQ ID NO.: 6. 

3. An isolated polypeptide according to claim 1 or 2, wherein said amino add sequence Is a 
sub-sequence of with a minimum length of 10 amino acids. 

15 4. A polypeptide according to claim 1, wherein said polypeptide comprises the amino add 
sequence shown In SEQ ID NO:2. 

5. A polypeptide according to claim 4, wherein said polypeptide consists of the amino acid 
sequence shown m SEQ ID NO: 2. 

20 

6. A polypeptide according to claim 1, wherein said polypeptide comprises the amino acid 
sequence shown in SEQ ID NO:4. 

7. A polypeptide according to claim 6, wherein said polypeptide consists of the amino acid 
25 sequence shown in SEQ ID NO:4. 

8. A polypeptide according to claim 1, wherein said polypeptide comprises the amino add 
sequence shown in SEQ ID NO: 6. 

30 9- A polypeptide according to claim 8, wherein said polypeptide consists of the amino add 
sequence shown in SEQ ID NO; 6. 

10. A polypeptide according to daim 1, wherein said amino acid sequence has at least 
80% sequence identity to SEQ ID N0:2. 

35 

11. A polypeptide according to claim 1, wherein said amino acid sequence has at least 
80% sequence identity to SEQ ID NO:4. 
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12. A polypeptide according to claim 1, wherein said amino acid sequence has at least 
80% sequence identity to SEQ ID NO: 6, 

13. An polypeptide to claim 1-12, wherein said amino acid is consistently up-regulated 
5 after antibody selection-induced change from VSAum to VSAsm expression, 

14. An polypeptide according to claim 1-13, wherein said amino add sequence is capable 
of mediating cyto-adhesion of Intact erythrocyte infected by a parasite to human 
endothelial ceils, but not to the CD36 receptor. 

10 

15. An isolated nucleic acid comprising a nucleotide sequence selected from the group 
consisting of at least one of 

a) SEQ ID NO.: 1, SEQ ID NO.: 3 and SEQ ID NO.: 5 

15 

for use as a medicament. 

16. A nucleic acid according to claim 15, wherein said nucleotide sequence has at least 
80% sequence identity to SEQ ID NO.: 1, SEQ ID NO.: 3 or SEQ ID NO.: 5. 

20 

17. A nucleic acid according to claim 15-16, wherein said nucleotide sequence is a sub- 
sequence of with a minimum length of 30 nucleotides. 

18. A nucleic add according to daim 15, wherein said nucleic add comprises the nucleotide 
25 sequence shown in SEQ ID NO: 1. 

19. A nucleic acid according to claim 18, wherein said nudeic acid consists of the 
nucleotide sequence shown in SEQ ID NO:l. 

30 20. A nudeic add according to daim 15, wherein said nudeic add comprises the nudeotide 
sequence shown in SEQ ID NO: 3. 

21. A nucleic acid according to claim 20, wherein said nucleic acid consists of the 
nudeotide sequence shown In SEQ ID NO:3. 

35 

22. A nudeic add according to daim 15, wherein said nudeic acid comprises the nudeotide 
sequence shown in SEQ ID NO: 5. 
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23. A nucleic acid according to claim 22, wherein said nucleic acid consists of the 
nucleotide sequence shown in SEQ ID NO:5. 

24. A nucleic acid according to claim 15, wherein said nucleotide sequence has at least 
5 80% sequence identity to SEQ ID NO:l. 

25. A nucleic acid according to claim 15, wherein said nucleotide sequence has at least 
80% sequence identity to SEQ ID NO: 3. 

10 26. A nucleic acid according to claim 15, wherein said nucleotide sequence has at least 
80% sequence identity to SEQ ID NO: 5. 

27. A nucleic acid sequence according to claim 15-26, wherein said sequence is 
consistently upregulated after antibody selection-induced change from VSAum to VSAsm 

15 expression. 

28. A nucleic acid according to claim 15-17, wherein said nucleic acid sequence encodes a 
polypeptide which is capable of mediating cyto-adhesion of intact erythrocyte infected 
by a parasite to human endothelial cells, but not the CD36 receptor. 

20 

29. A recombinant vector comprising the nucleic acid defined in any of claims 15-28 
operably linked to one or more control sequences for use as a medicament 

30. A composition comprising a polypeptide according to any of claims 1-14 or a nucleic 
25 acid according to any of claims 15-28 and a pharmaceutically acceptable diluent, carrier or 

adjuvant. 

31. A composition according to claim 30, wherein said composition is an immunogenic 
composition. 

30 

32. A composition according to claim 31, wherein said composition Induces an IgG/IgM 
antibody response. 

33. An isolated antibody or isolated antiserum induced in response to one or more 
35 polypeptides as defined in any of claims 1-14 and/or to one or more nucleic acids as 

defined in any of claims 15-28. 
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34. An antibody according to claim 33, wherein said antibody is capable of binding to a 
molecule expressed on tlie surface of an intact eryttirocyte infected by a parasite causing 
malaria. 

5 35. An antibody according to claim 33, wherein said antibody is capable of recognising 
parasites selected in vitro for expression of VSAsw 

36. An antibody according to claim 33, wherein said antibody is capable of binding to a 
molecule expressed on the surface of an intact erythrocyte infected by a parasite capable 
10 of mediating cyto-adhesion of intact erythrocyte infected by a parasite to human 
endothelial cells, but not the CD36 receptor. 

36. A vaccine comprising at least one nucleic acid according to any of claims 15-28 or at 
least one vector according to claim 29, the vaccine effecting in vivo expression of at least 

15 one antigen by a subject, to whom the vaccine has been administered, the amount of 
expressed antigen being effective to confer substantially increased resistance to malaria 
caused by Piasmodlum falciparum, 

37. Use of a polypeptide according to any of claims 1-14 for the manufacture of a 

20 composition to be administered in order to prophylactically or therapeutically reduce the 
incidence, prevalence or severity of malaria in a subject. 

38. Use of a polypeptide according to any of claims 1-14 for the manufacture of a vaccine 
for malaria prophylaxis. 

25 

39. Use of a polypeptide according to any of claims 1-12 for the manufacture of a 
composition for vaccination against malaria. 

40. Use of a nucleic acid according to any of claims 15-28 for the manufacture of an 

30 composition to be administered in order to prophylactically or therapeutically reduce the 
incidence, prevalence or severity of malaria in a subject. 

41. Use of a nucleic acid according to any of claims 1-28 for the manufacture of a vaccine 
for malaria prophylaxis. 

35 

42. Use of a nucleic acid according to any of claims 15-28 for the manufacture of a 
composition for vaccination against malaria. 
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43. Use of a recombinant vector according to claim 29 for the manufacture of a 
composition to be administered in order to prophylacticaily or therapeutically reduce the 
incidence, prevalence or severity of malaria in a subject. 

5 44. Use of a recombinant vector according to claim 29 for the manufacture of a vaccine for 
prophylactic treatment of severe malaria. 

45. Use of a recombinant vector according to claim 29 for the manufacture of a 
composition for vaccination against severe malaria. 



10 



46. Use according to any of claims 37-45, wherein said malaria is caused by Plasmodium 
falciparum. 

47. A method for prophylacticaily or therapeutically reduce the incidence, prevalence or 
15 severity of malaria in an subject said method comprising administering to said subject an 

effective amount of a polypeptide according to any of claims 1-14, a nucleic add according 
to any of claims 15-28 or a recombinant vector according to claim 29. 

48. A method for the prophylactic treatment of severe malaria in an subject, said method 
20 comprising administering to said subject an effective amount of a polypeptide according to 

any of claims 1-14, a nucleic acid according to any of claims 15-28 or a recombinant 
vector according to claim 29. 

49. A vaccination method against severe malaria in an subject, said vaccination method 
25 comprising administering to said subject an effective amount of a polypeptide according to 

any of claims 1-14, a nucleic acid according to any of claims 15-28 or a recombinant 
vector according to claim 29. 

50. A vaccine comprising any of the polypeptides according to any of claims 1-14, the 
30 nucleic acids according to any of claims 15-28 or the recombinant vector according to 

claim 29, said vaccine characterised in that it induces an immune response, wherein said 
immune response specifically recognises a molecule expressed on the suri^ace of an Intact 
erythrocyte infected by a parasites. 

35 51. A vaccine comprising one or more B-cell and/or T-cell epitopes originating from any of 
the polypeptides according to any of claims 1-14, the nucleic acids according to any of 
claims 15-28 or the recombinant vector according to claim 29, said vaccine characterised 
In that It induces an immune response, wherein said immune response specifically 
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recognises a molecule expressed on the surface of an intact erythrocyte infected by a 
parasites. 

52. A DNA vaccine, wliich results in the expression of a polypeptide comprising one or 
5 more B-cell and/or T cell epitopes from any of the polypeptide sequences according to 
claim 1-14, wherein said vaccine is capable of inducing an immune response, wherein said 
immune response specifically recognises a molecule expressed on the surface of an intact 
erythrocyte infected by parasites. 

10 53. A DNA vaccine comprising at least one nucleic acid sequences according 15-28, 
wherein said vaccine is capable of inducing an immune response, wherein said immune 
response specifically recognises a molecule expressed on the surface of an intact 
erythrocyte infected by parasites. 

15 54. An in vitro diagnostic method, said method comprising contacting a sample with a 
polypeptide according to any of claims 1-14 under conditions allowing an in vitro 
immunological reaction to occur between said polypeptide and the antibodies possibly 
present in said sample, and in vitro detect the antigen-antibody complexes possibly 
formed. 

20 

55. An fn vitro diagnostic method according to claim 54, wherein a disease-state profile for 
a tested subject is generated by determining the concentration or expression level in a 
sample of sequences as defined in any of claims 1-14 and/or 15-28. 

25 56. An in vitro diagnostic k\t comprising 

a) a sequence as defined in any of claims 1-14 and/or 15-28 

b) reagents for preparing a suitable medium for carrying out an 
immunological reaction between an antibody present in a sample of body 
fluid or tissue and said sequence; and 

30 c) reagents allowing the detection of the antigen-antibody complexes formed, 

wherein said reagents may bear a radioactive or non-radioactive label, 

57. A method for generating a vaccine against severe malaria comprising 

35 a) injecting a sequence according to any of claims 1-14 in a subject 

b) enabling said subject to generate antibodies specifically recognising any of the 
polypeptide sequences according to claim 1-14 

c) purify said antibodies 

d) selecting antibodies having cross-reactivity to parasites causing severe malaria 
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e) selecting antibodies having tlie ability to iniiibit adhesion to endothelial cells. 

58. A method for testing an inhibitor-molecule capable of inhibiting binding of any of the 
polypeptides according to claim 1-14 to a receptor expressed on endothelia cells 
5 comprising 

a) in vitro cultures of endothelial cells 

b) add potential inhibiting-molecule 

c) add RBC infected with parasites, said iRBC expressing any of said 
polypeptide sequences on their surface of the RBC 

10 d) measure the binding of the iRCB with said endothelia cells by microscopy 

or other means of quantifying binding as for instance liquid scintillation 
spectrometry. 
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Abstract 

The present invention relates to nucleic acid molecules related to the 
PFD123SW/MAL7P1.1, PF11_0008, and PF13_0003 gene families as well as amino acid 
sequences encoded by such nucleic acid molecules with respect to their role in mediating 
5 adhesion of infected red blood cells to endothelial cells, which is characteristic for the 
pathogenesis of severe malaria (SM). Accordingly, the invention provides pharmaceutical 
compositions and vaccines, hereunder nucleotide-based vaccines comprising compounds 
that are related to VAR4, VAR5, and/or VAR6 polypeptides and PFD1235w/MAL7Pl.l 
PFlljOOOS, and/or PF13_0003 nucleic acid molecules. The invention further relates to the 
10 use of these compounds as medicaments and for the manufacture of compositions, such as 
immunogenic compositions. In addition, the Invention relates to methods of treatment and 
prevention of severe malaria wherein these methods are based on the nucleic acid 
molecules and polypeptides of the invention. As these compounds can also be used as 
biotechnologicai tools the invention provides in vitro diagnostic methods and kits 
15 comprising reagents and IgGs/antibodies designated to the use in such methods. The 
invention also relates to methods of Identifying agents capable of modulating the VAR4, 
VAR5, and/or VAR6 dependent adhesion to endothelial cells and agent capable of 
interacting with VAR4, VAR5, and/or VAR6. Finally, a method for identifying polypeptides, 
which will induce a specific IgG/ antibody response upon administration to a subject is 
20 provided by the invention. 
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180 ' 200 • 22 0 240 _ 
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♦ 260 : 280 r 300 • _320 
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KCkMTPSSHKV PKHG Y IMSF5AFFDLW\'KKLLI D3 T NWKN e LTNCl NNThTVTDCKNDCNTNCKCFElW 

580 ^ 600 * 620 * 6^0 



lYKNENGNTlWYYKKLNllFkGfFFHVt-SKEeNKEaKWnKLMElIJCEKIDSSNLKNGTKDSEGAIKVLFDHLKDIAERCIDN 

660 * eeo • 700 ^ 720 



3V'AOIAE:n>:00?yiOKLLGTRGG£SNLKGDATRGTVivLGG<}GN~LNGDICKIT 
SVKQIi^HhIOQ:<AQKI'I>GTRGG£SHrKGDATRGTYls'LGGQGNTLN^ 
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<110> Kobenhavns Universitet 



10 



5 



Louise J0rgensen 
Pamela Magistrado 
Thomas Lavstsen 
Ali Salanti 
Morten A. Nielsen 
Trine Staalso 
Lars Hviid 
Thor Theander 
Anja Jensen 



<120> COMPOUNDS USEFUL IN THE DIAGNOSIS AND 
15 TREATMENT OF MALARIA 

<130> P36017DK01 

<160> 6 

20 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 10662 
25 <212> DNA 

<213> Plasmodium falciparum 

<220> 

<221> gene 
30 <222> (0) . . - (0) 

<223> var gene PFD1235w 

<400> 1 

atggggaatg catcatcatc agagggggag gctaaaaccc ctagtttaac agaaagtcac 60 

35 aacagtgcaa gaaatatttt ggaaggttat gccgaaagta taaaggaaca ggcatcaaaa 120 
gatgcaaaaa tacatggaca tcatttgaaa ggagatttgg cgaaagcagt atttcgtcat 180 
ccattttctg catatagacc taactatgga aatccgtgcg aacttgatta taggtttcat 240 
actaatgtat ggcatcgtaa cgcagaagat agaaatcctt gtctttttag tcgtgcaaaa 300 
cgtttttcaa atgaaggtga agcagaatgt aatggtggta taataactgg taataaaggt 360 

40 gaatgtgggg catgtgcacc gtataggaga agacatatat gtgactataa tttgcaccat 420 
ataaacgaaa ataatataag gaatactcat gatttattgg ggaatttgtt agttatggca 480 
aggagtgaag gtgaatctat tgtgaaaagt catgaatata caggttatgg tatatacaaa 540 
tcaggtatat gtacttctct tgctcgcagt tttgcagata taggagataf tattcgagga 600 
aaagatctgt atcgtcgtga tagtagaaca gataaattag aagagaattt aagaaaaatt 660 

45 ttcgcgaata tatataaaga attgaagaat gggaagaagt gggcggaagc aaaagagtac 720 
taccaagatg atggaactgg aaattattat aaattaaggg aagcttggtg ggcacttaac 780 
agaaaagatg tgtggaaagc attaacatgt agtgcgccaa gggatgctca atatttcata 840 
aaatcaagcg tcagggatca aacattttca aatgattatt gtggccatgg tgaacatgag 900 
gttcttacaa atttagatta tgtccctcaa tttttacgat ggtttgaaga atgggcagaa 960 

50 gagttttgta gaataaaaaa aataaaatta ggaaaggtta aggaagcatg tcgtgatgac 1020 
tcaaaaaaat tatattgtag tcataatgga tatgactgta cgaaaactat tcgaaataaa 1080 
gatattttgt ctgataatcc taaatgtact gggtgttctg ttaaatgcaa agtttatgaa 1140 
ctttggttaa ggaatcaacg aaatgaattt gaaaaacaaa aaaaaaaata ttataaggaa 1200 
atacagacat atacatcgaa ggacgctaaa actgatagta atattaataa cgaatattat 1260 

55 aaggaatttt atgacaaact taaaaatgag ggctatgaaa cattgaacaa atttataaaa 1320 
ttactaaatg aaggaaggta ttgtaaagaa aaaatatcag gagaaaggaa tattgatttt 1380 
actatgactg gtgataaaga cgcgttttat cgctcagact attgccaaat atgtcctgaa 1440 
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tgtggagtcc aatgtagcgg tacaacatgc acaccaaaaa aagtgataca tccgaattgc 1500 
aaagataaag aaacttatga gcctggtgat gcaaaaacca ctgatattac tgtccttUat 1560 
agtggtgatg aagaaggtga tattgcacaa aaattacaag atttttgtaa tgataaaaat 1620 
aaagaaaaug afcgaaaacta tgaaaaatgg caatgctatt ataaaagtag tgagattaat 1680 
5 aaatgtcaaa tgacaccatc atcacacaaa gttccaaaac atggttacat tatgtcattt 1740 
tatgcttttt ttgatttgtg ggttaagaat ttattaatag atagtataaa ttggaagaac 1800 
gatcttacga attgtataaa taatactaat gttacggatt gtaaaaatga ttgtaacaca 1860 
aattgtaaat gttttgaaaa ttgggctaaa acaaaagaaa atgagtggaa aaaagtgaag 1920 
acgatataca aaaatgaaaa cggaaacacg aacaattatt ataaaaaact taataaccat 1980 

10 tttcaaggtt atttttttca cgttatgaaa gagcttaaca aagaagaaaa atggtataaa 2040 
cttatggaag atttaaaaga aaaaattgat tcttccaatt tgaaaaatgg tacgaaagat 2100 
tcagaaggcg caataaaagt gttgttcgat cacttaaaag atatagctga aagatgcata 2160 
gacaataatt caaaagattc atgtccacct tcagtggata cgaaaacaaa cccctgtgct 2220 
aaacctcctg gtagtaaacc cactaaaagt gtaaaacaat tagcggaaca tatgcaacag 2280 

15 aaggcacaaa aacttttggg tactcgtggt ggtgaaagta aattgaaggg ggatgcaaca 2340 
agagggacgt ataaccttgg aggtcaagga aacacgttga atggcgatat atgtaaaata 2400 
acaaaaaatc ataccaatga tagtcgtcct aatggtgaac catgtacagg taaagataaa 2460 
gttaaaaacg ggtttcgctt gaaaatagga accccgtgga caaatattgt acaaaaaaaa 2520 
aaaaaaaagt catacaaaga cttctattta cctcctcggc gtcaacatat gtgtacatca 2580 

20 aatttagaaa atttgagcac gagtagcaaa ggacttagta atggtagttt tgctagtcac 2640 
tcattattag gtgatgtatt gctcgcagca aaatttgaag cacaaaagat aatactagtg 2700 
tataaaaata agaataatat aaatatcaga aaaagaataa ctgacccaaa tgatcaagct 2760 
actgtatgtc gtgctatacg ttacagtttc gccgatctag gagatattat acgaggaaaa 2820 
gacatgtgga atataaacag tgatgcaaaa gatcttcaag atcgtttaga aaaaatafctt 2880 

25 aaaaccatta atgaaaaact tcctaatgaa atccaaaaaa gatatacgaa ccgtgaaaat 294 0 
aaacatttag atttacgttc agactggtgg gaagctaata gacatcaagt ttggagagct 3000 
atgaaatgtg caacaaaagg catcagcaat aacaattgta atggtatccc aatagaagat 3060 
tacatcccac aacgattaag atggatgacg gaatgggccg aatggtattg caaaaagcag 3120 
tcacaggaat atgagaagtt ggaggagaag tgtgggatgt gtacgggtaa gggtcagggt 3180 

30 gatggtaaag attgtacaca gaaggataaa gaatgtagtc cgtgcaagaa agcatgtgat 3240 
gcatataaga aggaaataga aaaatgggaa aaacaatgga aaacagtatc agctatatac 3300 
caaatattat acgcaaaagc acgaattgtt gctagtaatg gcggtcctgg gtattataat 3360 
acggaagtac agaagaaaga ccgatccgtg tatgacttct tgtacgagtt acatttacaa 3420 
aatggtggca aaaaaggtcc tcctcctgct acacatcctt ataaatctgt taacacacgt 3480 

35 gataaacgtg atgccactga tgatacaaca cccactgtgt atagtactgc tgcaggatat 354 0 
gtacaccaag aagcacatat tggtgattgt aaggaacaac acgttttttg tgataataac 3600 
ggcaacaagg agaagtatgc ttttaagaat ccaccaaatg tatatgttga ggcgtgtaag 3660 
tgtatgacga gggaggcacc accaccacca acaactcctt ctactccaaa tccgtgtgct 3720 
gaaactggtg gtgtacatac cattaaaact gtgactgatg tcgcaaaaat attacagggg 3780 

40 gaggcaaatg aaacaatgct aaaaaatagt tccaatggta atgataagga tgagagtaaa 3840 
ttgaaaggta aggcagaaga aggggattat agtcgtggag gtacgccaag tgacttcaac 3900 
aacaatttat gtggtataac acaaaagcat tccaatgctc ataatgattc acaacaacca 3960 
tgttatggaa aagatcaaaa aaggttcaat gtaggaacgg aatggtcatt taaggataat 4020 
catagaaaac ggacacaccc tgaggcatat atgcctccaa gaagggaaca tatatgtaca 4080 

45 tcaaatttgg aatatttaat tcataagaga aaaaaaccaa ttatagaagg tgatcctaac 4140 
aagattattc attccttatt gggcgatgtg ttacttgcag caaaatatga agcagaaaac 4200 
ataaagaaac tgtatgaaga aaataacaac cgaaaagatc aggaaggtat atgtcgagct 4260 
atgaaatata gttttgcaga tataggggat attattcgag gaaaagatat gtggatagaa 4320 
aacaatgatg ctaagagatt acaaacaaat ttgaaagaaa tatttactaa aattaaagaa 4380 

50 aaaactggag gcaccacata taatgaagat aacgatccgt atttaaaatt acgtgcagat 44 40 
tggtgggaag ctaatagagc caaagtatgg aaagcaatga aatgtaaaac aaatggcgta 4500 
gatatcactt gtgatagtga tcatacacca ttggatgatt atatccccca aagattaaga 4560 
tggatgactg aatgggcaga atggtattgt aaagcgcaat cacaggagta taagaagttg 4 620 
gaggagaagt gtagtcaatg caagagtaag ggtaaaggtg ggaatgagtg ttacagagaa 4680 

55 acgaaggaat gtaacgattg caagcaagca tgtgaagaat ataaaaggaa aataaaaaca 4740 
tgggcagatc aatggaaggt aatatcaaat aaatacgagg atttatacaa aaaagcccaa 4800 
aatcctacta atgctgttct caaagataac aaagatgaga aagacaagaa tgtgattgat 4860 
tttttgacac aattacaaaa agcaaataat ggcgaaaaaa ctggtgtgca caccgtgtat 4 920 
tctactgctg caggatatat acatcaagag gcacgtacac gtgaatgcca ggaacaaagg 4 980 
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gagttttgtg 
catgggUatg 
aaggaagaaa 
gataaaacta 
5 tatcctggat 
cctccaagaa 
ccagaagaat 
caaaaataca 
atcGctgatg 
10 ttagatactg 
aatatagatt 
tggggaaaaa 
acaaacaaaa 
gccataaaat 
15 gaatggagtg 
tgtaacgatt 
gatgataaag 
aaaaattgga 
gaactatata 
20 gcacaattaa 
gaaacaaccg 
gaacccgaag 
ctagcacttc 
aatagaggac 
25 ccacaagctg 
aatctaccac 
gatgacgacg 
ggacacgtcg 
gccacagaag 
30 atttgcgcca 
ctcaaatatg 
aaaacagaca 
agtgatagtg 
aagatagtag 
35 aatgggtcac 
ggtactgagt 
gtggctctcc 
aaatataaag 
gttcaacgta 
40 gattttaagc 
aagactatga 
aaaataccaa 
ccaaatagtg 
ggaatgatat 
45 caggttaaag 
gaaactgtgt 
aaaaattt tg 
tgtcgaaaac 
tgtagtggtg 
50 catttcgatt 
aaaaaaacag 
aatggaagta 
acgtgcacta 
aatggaggca 
55 tgtaaacctt 
caaaaggatt 
gtagatatgc 
cagggtgcag 
ggtatagata 



ataaaaaaaa 
ctacagcgtg 
atgtagagag 
caggtggaat 
ggaattgtac 
gaataaaatt 
tgagaaaagc 
aaaaggataa 
actttaagcg 
atatatcatc 
ctgttttcca 
acgcagaagc 
atatggatta 
ctgaactgga 
acgaattttg 
gtactgttag 
ataaatgtga 
aaactcaata 
aaagtatcga 
aaaaactttg 
gacagcctga 
aagtgaatgg 
caccaccagc 
gatcggaacg 
cacaaccacc 
cagctgacag 
aagtccagga 
agacagagga 
tcacaaaaca 
cagtggccaa 
gcaaaaacaa 
caagtgagaa 
aaaaaggttc 
attgggcgga 
aggaagtagt 
cacaggcgag 
tccacgcctt 
tggataaaga 
aaacaagcga 
gtcaaatgtt 
ttgaggcgtt 
aaattttaga 
gtaaaacacc 
gtgctttaac 
ctacggacaa 
catttggtgc 
tggtacgccc 
aaaaacataa 



atggcttgcg 
gtcccagttg 
aatatgagaa 
atggtgatgg 
aagcaacaaa 
ctgacataaa 
gttctgaatt 
gcccaaataa 
gtgttagtga 
gtatatttaa 
tatgtactct 



tggcattgac 
tgattgcata 
tgcgtgcaaa 
agatcattgt 
tccaggtcag 
atgtgtaatt 
ttttattcaa 
gaatggtggt 
tcaaatgttc 
aaaagcaga t 
aaaaattgac 
tatttgggat 
caatgcacac 
agactttgtg 
tacagaacgt 
tgagagtggt 
cgagtgcaaa 
taaaacacaa 
tgacgtcgcc 
tggtaatgct 
taactcccac 
aaagtgtaat 
accatcggga 
tggtgaccaa 
acaaccaaaa 
aaataccaat 
ggaggaggaa 
ggagacgaag 
ggggtcggca 
agcacttaag 
ctcacgttta 
tggtgcccca 
catatgtgtg 
atcacagtcg 
tagtgttaat 
tgatgtgtca 
tgtgaagtcc 
aatagaggaa 
gaacccccaa 
ctatacttta 
agaaaagagt 
tggtgagaac 
acaagaatgg 
atacaacaca 
cacagatctt 
tagtggtacc 
cacatatttt 
gttatatata 
ttgtgacgaa 
tgccagacat 
acaagaaagc 
aggtaataat 
ctttttagaa 
atttagtaat 
taaagtaaat 
tacgattact 
taacactgaa 
aggfcattaga 
ggaaaaaacc 



aacacaagtt 
aataggtcgc 
atagtggaag 
aatccaaagt 
tttaaatcag 
aatttacaat 
tgcgctgcaa 
gttgcacaag 
tatacgtttg 
acaagtacag 
ataactaatg 
ggaatgttat 
acaaaattaa 
aacagacctc 
agtataaaga 
actagtgatg 
agagcatgta 
agcaaaaaat 
agttctacac 
gattgcaagt 
gattcccata 
tgtaaagtga 
cctccagctg 
ggcccactac 
ccaaaacgca 



ctctccgatt 
acgccaccgt 
ccggtgaagg 
ccaacggcaa 
ggcgacaaaa 
ggttggaaat 
cgtcgtgctc 
ccgccgcgaa 
aagacagtaa 
ggagctagtg 
caaggtaacg 
gctgcaatag 
aaggaaaaac 
aagaaattag 
ggagattata 
ggtgacacga 
aacaaagctg 
tggaaagaaa 
gacagtaatg 
ttccaaaaac 
ggcgccaaaa 
cgt tggttag 
attaaaaaag 
aaagttccag 
tgtagatctt 
gcatatagta 
aatgataaag 
tcattaaaag 
acaaatataa 
tgtgaaaatg 
tcacaaaata 
agtggatttg 
aaagatgaat 
aataatggga 



atgcttttaa 
aaacagagga 
aagttctttc 
attatccaag 
gtcatgctgg 
atttgaatga 
tagaaacgta 
caaaattaaa 
gagattatag 
gtgtaggtaa 
tcgaacaacg 
gtgctttaag 
atcccacgta 
aattccttcg 
tcaaggagtt 
ctacgggtaa 
caacatataa 
attttgatga 
aagcctatca 
gtatggatgg 
tgcctgcatc 
aacatcgtcc 
aagaccaaat 
cagcgcgacc 
ctggagaagg 
ccgaagaaga 
cggaggcgga 
aaaagacgga 
caacaccaac 
gtctcaatgc 
gtataccaac 
gtagtgccca 
gacgacgatt 
caagtgttaa 
agagtggtgg 
gcgcgtcgac 
agacgttttt 
aggcagcaca 
aaggtggtga 
gggatatttt 
aaatagaaga 
ctggtggtgg 
acgcaaaaca 
gaaaggacaa 
tgaaaaaaga 
gcaacgacga 
aggaatgggg 
attgtcgtga 
ataagaaaga 
atagaaaatg 
aacaaaaaag 
aattttacac 
gacaatgtat 
catttggatc 
gtagttgtgg 
ttaaaggtct 
aaggtgatt t 
ggaaatgtgg 
aagaaagcga 



ggatccacct 
gccgaagaaa 
gaaaccacga 
aaaagaaaat 
agcatgtatg 
gaagaaatca 
ttggttatgg 
tagtggtacc 
agatttatgt 
agtaaaaatt 
taaaccttgg 
ttataatact 
cggctacaac 
atggttcact 
ggaaacaaaa 
taaaacatgt 
aacttggctt 
taaaagaaaa 
atat ttacat 
tgagtccaaa 
attagatgat 
acaacctccg 
tgagcatgac 
tcctcctccc 
cctcggtcgt 
agacgacgaa 
ggaaggtgaa 
aggggcgggg 
agtagaagat 
cgcatgtgcc 
tagtggtgac 
tggtggtaaa 
atatataaaa 
tggagatggt 
tagtggtagt 
atcgccacaa 
tgcttggcat 
aaatcatcta 
aatacctgaa 
agtgggggac 
tatatcggaa 
ccccaaacaa 
catttggcat 
aaaaatacaa 
caacgactac 
taccaaatta 
agaagagttt 
taataagttt 
tatttttaag 
gatagaaaga 
taattacgta 
aaaactagaa 
tggtaataat 
tgcagaagat 
gagtgctaag 
tactgaccaa 
aggcatttgt 
tgatttctgt 
taaaaaatat 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
57 60 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 



36017DK01 



4/31 

atcataatga aagaattcgt taaaagatgg ctagaatatt tttttgaaga ttataataga 8580 
attcaaaaaa aattaaagac atgtaaagaa aatggtaaag gatccacatg tataagaagt 8640 
tgtgtagatg aatggataaa gctgaaaaag gatgaatggc aaaaaattaa cagtaattac 8700 
cttgaccaaa atacaaaaga aaatcctgaa ggtaataatt taagctcttt tttggaggat 8760 
5 ggaccgttta agaatgaggt tgataaagct ataaaacctt gtggtaattt aactgatttc 8820 
aagaagtcaa agaaatgtaa tggcacttcc agatcaggaa atagtgaaga gagcacaaaa 8880 
tatgatggtg ttatatgttt gcttgataat cttaaaaaca taataaaaac ttgtcaaaac 8940 
gtacctagtg gcaaaccaga tacaccgtgt caaaaatccc ccgcccccgt tggagacgat 9000 
gatgatcccc ttgaagagga aaacccagta acacaaccga acatttgtcc gcaaacatca 9060 

10 gtggaagaaa aaaaaaaaga ggaagaagaa aagtgtgatg aaaaggagga agaagaagaa 9120 
aaagaggagg aaaaagataa aggagatgag gaagtaaaag aagaagaaaa agataaagga 9180 
gatgaggaag aagaagcaga agaagaagaa gaagaagaag aggaaacaga tagtcacatt 924 0 
tatgaagact actctgattc agacgcagag gaagatgatg aagatgaagc tgttacagaa 9300 
tccttatcac cttcagagtc acaaccaaaa cgattgctac gagaatttcc atcccccgaa 9360 

15 ttaaaaaatg ccatgttatt ttctaccatc ctctggatgg taggtatcgg ttttgcggcg 9420 
ttcacttatt tttttctaaa gaaaaaaccg aaatcacctg ttgacctctt acgtgtactt 9480 
gacatccata aaggcgatta tggaacacct accccgaaat caaaaaatag atatatcccc 9540 
tatgtgagtg atacatataa agggaaaaca tacttatatg ttgaaggaga tacagacgaa 9600 
gagaaatata tgtttctgtc tgatactact gatataacct cttccgaaag tgagtatgaa 9660 

20 gaattggata ttaatgatat atatgtacca ggtagtccta aatataaaac attgatagaa 9720 
gtagtattgg aaccatcaaa aagtgatggt aacacaccag gtaagggtga tggtaacaca 9780 
ctaggtgatg atatggtacc taccacgaat acatttacag atgaggaatg gaatgaactg 9840 
aaacaggatt ttgtatcaca atatatacaa agtagattac caatggatgt accacaatat 9900 
gatgtatcaa cggagagtcc aatgaatata ggaggtaatg ttttagatga tggfcatggat 9960 

25 gaaaaacctt ttattacttc tattcatgat agggatttaa atagtggaga agaaattagt 10020 
-tataatattc atatgagtac taacactaat aatgatattc caaaatatgt atcaaataat 10080 
gtatattctg gtatagattt aattaatgat acattaagtg ataacaaaca tattgatata 10140 
tatgatgaag tgctaaaaag aaaagaaaat gaattatttg gaacaaatta taagaaaaat 10200 
acatcaaaca atagtgtagc aaaaaatact aatagtgatc caattatgaa ccaattagat 10260 

30 ttgttacata aatggttaga tagacataga gatatatgtg aaaattgggg gaaaaaagaa 10320 
gatattttga ataaattgaa tgaacaatgg aataaagata atgatggtgg tgatatacca 10380 
aatgataaca aaaagttgaa tacggatgtt tcgatacaaa tagatatgga tgaaactaaa 10440 
ggaaagaagg aatttagtaa tatggatact atcttggatg atatggaaga tgatatatat 10500 
tatgatgtaa atgatgaaaa cccatctgta gatgatatac ctatggatca taataaagta 10560 

35 gatgtaccaa agaaagtaca tgttgaaatg aaaatcctta ataatacatc taatggatcg 10620 
ttggaacaac aatttcctat atcggatgta tggaatatat aa 10662 
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Gly lie lie Thr Gly Asn Lys Gly Glu Cys Gly Ala Cys Ala Pro Tyr 

115 120 125 

Arg Arg Arg His lie Cys Asp Tyr Asn Leu His His He Asn Glu Asn 
130 135 140 

5 Asn He Arg Asn Thr His Asp Leu Leu Gly Asn Leu Leu Val Met Ala 
145 150 155 160 

Arg Ser Glu Gly Glu Ser He Val Lys Ser His Glu Tyr Thr Gly Tyr 

165 170 1*75 

Gly He Tyr Lys Ser Gly He Cys Thr Ser Leu Ala Arg Ser Phe Ala 
10 180 185 190 

Asp He Gly Asp He He Arg Gly Lys Asp Leu Tyr Arg Arg Asp Ser 

195 200 205 

Arg Thr Asp Lys Leu Glu Glu Asn Leu Arg Lys He Phe Ala Asn He 
210 215 220 

15 Tyr Lys Glu Leu Lys Asn Gly Lys Lys Trp Ala Glu Ala Lys Glu Tyr 
225 230 235 240 

Tyr Gin Asp Asp Gly Thr Gly Asn Tyr Tyr Lys Leu Arg Glu Ala Trp 

245 250 255 

Trp Ala Leu Asn Arg Lys Asp Val Trp Lys Ala Leu Thr Cys Ser Ala 
20 260 265 270 

Pro Arg Asp Ala Gin Tyr Phe He Lys Ser Ser Val Arg Asp Gin Thr 

275 280 285 

Phe Ser Asn Asp Tyr Cys Gly His Gly Glu His Glu Val Leu Thr Asn 
290 295 300 

25 Leu Asp Tyr Val Pro Gin Phe Leu Arg Trp Phe Glu Glu Trp Ala Glu 
305 310 315 320 

Glu Phe Cys Arg He Lys Lys He Lys Leu Gly Lys Val Lys Glu Ala 

325 330 335 

Cys Arg Asp Asp Ser Lys Lys Leu Tyr Cys Ser His Asn Gly Tyr Asp 
30 340 345 350 

Cys Thr Lys Thr He Arg Asn Lys Asp He Leu Ser Asp Asn Pro Lys 

355 360 365 

Cys Thr Gly Cys Ser Val Lys Cys Lys Val Tyr Glu Leu Trp Leu Arg 
370 375 380 

35 Asn Gin Arg Asn Glu Phe Glu Lys Gin Lys Lys Lys Tyr Tyr Lys Glu 
385 390 395 400 

He Gin Thr Tyr Thr Ser Lys Asp Ala Lys Thr Asp Ser Asn He Asn 

405 410 415 

Asn Glu Tyr Tyr Lys Glu Phe Tyr Asp Lys Leu Lys Asn Glu Gly Tyr 
40 420 425 430 

Glu Thr Leu Asn Lys Phe He Lys Leu Leu Asn Glu Gly Arg Tyr Cys 

435 440 445 

Lys Glu Lys He Ser Gly Glu Arg Asn He Asp Phe Thr Met Thr Gly 
450 455 460 

45 Asp Lys Asp Ala Phe Tyr Arg Ser Asp Tyr Cys Gin He Cys Pro Glu 
465 470 475 480 

Cys Gly Val Gin Cys Ser Gly Thr Thr Cys Thr Pro Lys Lys Val He 

485 490 495 

His Pro Asn Cys Lys Asp Lys Glu Thr Tyr Glu Pro Gly Asp Ala Lys 
50 500 505 510 

Thr Thr Asp He Thr Val Leu Tyr Ser Gly Asp Glu Glu Gly Asp He 

515 520 525 

Ala Gin Lys Leu Gin Asp Phe Cys Asn Asp Lys Asn Lys Glu Asn Asp 
530 535 540 

55 Glu Asn Tyr Glu Lys Trp Gin Cys Tyr Tyr Lys Ser Ser Glu He Asn 
545 550 555 560 

Lys Cys Gin Met Thr Pro Ser Ser His Lys Val Pro Lys His Gly Tyr 

565 570 575 

He Met Ser Phe Tyr Ala Phe Phe Asp Leu Trp Val Lys Asn Leu Leu 
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580 585 590 

lie Asp Ser He Asn Trp Lys Asn Asp Leu Thr Asn Cys He Asn Asn 

595 600 605 

Thr Asn Val Thr Asp Cys Lys Asn Asp Cys Asn Thr Asn Cys Lys Cys 
5 610 615 620 

Phe Glu Asn Trp Ala Lys Thr Lys Glu Asn Glu Trp Lys Lys Val Lys 
625 630 635 640 

Thr He Tyr Lys Asn Glu Asn Gly Asn Thr Asn Asn Tyr Tyr Lys Lys 
645 650 655 

10 Leu Asn Asn His Phe Gin Gly Tyr Phe Phe His Val Met Lys Glu Leu 
660 665 670 

Asn Lys Glu Glu Lys Trp Tyr Lys Leu Met Glu Asp Leu Lys Glu Lys 

675 680 685 

He Asp Ser Ser Asn Leu Lys Asn Gly Thr Lys Asp Ser Glu Gly Ala 
15 690 695 700 

He Lys Val Leu Phe Asp His Leu Lys Asp He Ala Glu Arg Cys He 
705 710 715 720 

Asp Asn Asn Ser Lys Asp Ser Cys Pro Pro Ser Val Asp Thr Lys Thr 
725 730 735 

20 Asn Pro Cys Ala Lys Pro Pro Gly Ser Lys Pro Thr Lys Ser Val Lys 
740 745 750 

Gin Leu Ala Glu His Met Gin Gin Lys Ala Gin Lys Leu Leu Gly Thr 

755 760 765 

Arg Gly Gly Glu Ser Lys Leu Lys Gly Asp Ala Thr Arg Gly Thr Tyr 
25 770 775 780 

Asn Leu Gly Gly Gin Gly Asn Thr Leu Asn Gly Asp He Cys Lys He 
785 790 795 800 

Thr Lys Asn His Thr Asn Asp Ser Arg Pro Asn Gly Glu Pro Cys Thr 
805 810 815 

30 Gly Lys Asp Lys Val Lys Asn Gly Phe Arg Leu Lys He Gly Thr Pro 
820 825 830 

Trp Thr Asn He Val Gin Lys Lys Lys Lys Lys Ser Tyr Lys Asp Phe 

835 840 845 

Tyr Leu Pro Pro Arg Arg Gin His Met Cys Thr Ser Asn Leu Glu Asn 
35 850 B55 860 

Leu Ser Thr Ser Ser Lys Gly Leu Ser Asn Gly Ser Phe Ala Ser His 
865 870 875 880 

Ser Leu Leu Gly Asp Val Leu Leu Ala Ala Lys Phe Glu Ala Gin Lys 
885 890 895 

40 He He Leu Val Tyr Lys Asn Lys Asn Asn He Asn He Arg Lys Arg 
900 905 910 

He Thr Asp Pro Asn Asp Gin Ala Thr Val Cys Arg Ala He Arg Tyr 

915 920 925 

Ser Phe Ala Asp Leu Gly Asp He He Arg Gly Lys Asp Met Trp Asn 
45 930 935 940 

He Asn Ser Asp Ala Lys Asp Leu Gin Asp Arg Leu Glu Lys He Phe 
945 950 955 960 

Lys Thr He Asn Glu Lys Leu Pro Asn Glu He Gin Lys Arg Tyr Thr 
965 970 975 

50 Asn Arg Glu Asn Lys His Leu Asp Leu Arg Ser Asp Trp Trp Glu Ala 
980 985 990 

Asn Arg His Gin Val Trp Arg Ala Met Lys Cys Ala Thr Lys Gly He 

995 1000 1005 

Ser Asn Asn Asn Cys Asn Gly He Pro He Glu Asp Tyr He Pro Gin 
55 1010 1015 1020 

Arg Leu Arg Trp Met Thr Glu Trp Ala Glu Trp Tyr Cys Lys Lys Gin 
1025 1030 1035 1040 

Ser Gin Glu Tyr Glu Lys Leu Glu Glu Lys Cys Gly Met Cys Thr Gly 
1045 1050 1055 
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Lys Gly Gin Gly Asp Gly Lys Asp Cys Thr Gin Lys Asp Lys Glu Cys 

1060 1065 1070 

Ser Pro Cys Lys Lys Ala Cys Asp Ala Tyr Lys Lys Glu He Glu Lys 
1075 1080 1085 

5 Trp Glu Lys Gin Trp Lys Thr Val Ser Ala He Tyr Gin He Leu Tyr 
1090 1095 1100 

Ala Lys Ala Arg He Val Ala Ser Asn Gly Gly Pro Gly Tyr Tyr Asn 
1105 1110 1115 1120 

Thr Glu Val Gin Lys Lys Asp Arg Ser Val Tyr Asp Phe Leu Tyr Glu 
10 1125 1130 1135 

Leu His Leu Gin Asn Gly Gly Lys Lys Gly Pro Pro Pro Ala Thr His 

1140 1145 1150 

Pro Tyr Lys Ser Val Asn Thr Arg Asp Lys Arg Asp Ala Thr Asp Asp 
1155 1160 1165 

15 Thr Thr Pro Thr Val Tyr Ser Thr Ala Ala Gly Tyr Val His Gin Glu 
1170 1175 1180 

Ala His He Gly Asp Cys Lys Glu Gin His Val Phe Cys Asp Asn Asn 
1185 1190 1195 1200 

Gly Asn Lys Glu Lys Tyr Ala Phe Lys Asn Pro Pro Asn Val Tyr Val 
20 1205 1210 1215 

Glu Ala Cys Lys Cys Met Thr Arg Glu Ala Pro Pro Pro Pro Thr Thr 

1220 1225 1230 

Pro Ser Thr Pro Asn Pro Cys Ala Glu Thr Gly Gly Val His Thr He 
1235 1240 1245 

25 Lys Thr Val Thr Asp Val Ala Lys He Leu Gin Gly Glu Ala Asn Glu 
1250 ^ 1255 1260 

Thr Met Leu Lys Asn Ser Ser Asn Gly Asn Asp Lys Asp Glu Ser Lys 
1265 1270 1275 1280 

Leu Lys Gly Lys Ala Glu Glu Gly Asp Tyr Ser Arg Gly Gly Thr Pro 
30 1285 1290 1295 

Ser Asp Phe Asn Asn Asn Leu Cys Gly He Thr Gin Lys His Ser Asn 

1300 1305 1310 

Ala His Asn Asp Ser Gin Gin Pro Cys Tyr Gly Lys Asp Gin Lys Arg 
1315 1320 1325 

35 Phe Asn Val Gly Thr Glu Trp Ser Phe Lys Asp Asn His Arg Lys Arg 
1330 1335 1340 

Thr His Pro Glu Ala Tyr Met Pro Pro Arg Arg Glu His He Cys Thr 
1345 1350 1355 1360 

Ser Asn Leu Glu Tyr Leu He His Lys Arg Lys Lys Pro He He Glu 
40 1365 1370 1375 

Gly Asp Pro Asn Lys He He His Ser Leu Leu Gly Asp Val Leu Leu 

1380 1385 1390 

Ala Ala Lys Tyr Glu Ala Glu Asn He Lys Lys Leu Tyr Glu Glu Asn 
1395 1400 1405 

45 Asn Asn Arg Lys Asp Gin Glu Gly He Cys Arg Ala Met Lys Tyr Ser 
1410 1415 1420 

Phe Ala Asp He Gly Asp He He Arg Gly Lys Asp Met Trp He Glu 
1425 1430 1435 1440 

Asn Asn Asp Ala Lys Arg Leu Gin Thr Asn Leu Lys Glu He Phe Thr 
50 1445 1450 1455 

Lys He Lys Glu Lys Thr Gly . Gly Thr Thr Tyr Asn Glu Asp Asn Asp 

1460 1465 1470 

Pro Tyr Leu Lys Leu Arg Ala Asp Trp Trp Glu Ala Asn Arg Ala Lys 
1475 1480 1485 

55 Val Trp Lys Ala Met Lys Cys Lys Thr Asn Gly Val Asp He Thr Cys 
1490 1495 1500 

Asp Ser Asp His Thr Pro Leu Asp Asp Tyr He Pro Gin Arg Leu Arg 
1505 1510 1515 1520 

Trp Met Thr Glu Trp Ala Glu Trp Tyr Cys Lys Ala Gin Ser Gin Glu 
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1525 1530 1535 

Tyr Lys Lys Leu Glu Glu Lys Cys Ser Gin Cys Lys Ser Lys Gly Lys 

1540 1545 1550 

Gly Gly Asn Glu Cys Tyr Arg Glu Thr Lys Glu Cys Asn Asp Cys Lys 
5 1555 1560 1565 

Gin Ala Cys Glu Glu Tyr Lys Arg Lys lie Lys Thr Trp Ala Asp Gin 

1570 1575 1580 

Trp Lys Val lie Ser Asn Lys Tyr Glu Asp Leu Tyr Lys Lys Ala Gin 
1585 1590 1595 1600 

10 Asn Pro Thr Asn Ala Val Leu Lys Asp Asn Lys Asp Glu Lys Asp Lys 

1505 1610 1615 

Asn Val He Asp Phe Leu Thr Gin Leu Gin Lys Ala Asn Asn Gly Glu 

1620 1625 1630 

Lys Thr Gly Val His Thr Val Tyr Ser Thr Ala Ala Gly Tyr He His 
15 1635 1640 1645 

Gin Glu Ala Arg Thr Arg Glu Cys Gin Glu Gin Arg Glu Phe Cys Asp 

1650 1655 1660 

Lys Lys Asn Gly He Asp Asn Thr Ser Tyr Ala Phe Lys Asp Pro Pro 
1665 1670 1675 1680 

20 His Gly Tyr Ala Thr Ala Cys Asp Cys He Asn Arg Ser Gin Thr Glu 

1685 1690 1695 

Glu Pro Lys Lys Lys Glu Glu Asn Val Glu Ser Ala Cys Lys He Val 

1700 1705 1710 

Glu Glu Val Leu Ser Lys Pro Arg Asp Lys Thr Thr Gly Gly He Asp 
25 1715 1720 1725 

His Cys Asn Pro Lys Tyr Tyr Pro Arg Lys Glu Asn Tyr Pro Gly Trp 

1730 1735 1740 

Asn Cys Thr Pro Gly Gin Phe Lys Ser Gly His Ala Gly Ala Cys Met 
1745 1750 1755 1760 

30 Pro Pro Arg Arg He Lys Leu Cys Val He Asn Leu Gin Tyr Leu Asn 

1765 1770 1775 

Glu Lys Lys Ser Pro Glu Glu Leu Arg Lys Ala Phe He Gin Cys Ala 

1780 1785 1790 

Ala He Glu Thr Tyr Trp Leu Trp Gin Lys Tyr Lys Lys Asp Lys Asn 
35 1795 1800 1805 

Gly Gly Val Ala Gin Ala Lys Leu Asn Ser Gly Thr He Pro Asp Asp 

1810 1815 1820 

Phe Lys Arg Gin Met Phe Tyr Thr Phe Gly Asp Tyr Arg Asp Leu Cys 
1825 1830 1835 1840 

40 Leu Asp Thr Asp He Ser Ser Lys Ala Asp Thr Ser Thr Gly Val Gly 

1845 1850 1855 

Lys Val Lys He Asn He Asp Ser Val Phe Gin Lys He Asp He Thr 

1860 1865 1870' 

Asn Val Glu Gin Arg Lys Pro Trp Trp Gly Lys Asn Ala Glu Ala He 
45 1875 1880 1885 

Trp Asp Gly Met Leu Cys Ala Leu Ser Tyr Asn Thr Thr Asn Lys Asn 

1890 1895 1900 

Met Asp Tyr Asn Ala His Thr Lys Leu Asn Pro Thr Tyr Gly Tyr Asn 
1905 1910 1915 1920 

50 Ala He Lys Ser Glu Leu Glu Asp Phe Val Asn Arg Pro Gin Phe Leu 

1925 1930 1935 

Arg Trp Phe Thr Glu Trp Ser Asp Glu Phe Cys Thr Glu Arg Ser He 

1940 1945 1950 

Lys He Lys Glu Leu Glu Thr Lys Cys Asn Asp Cys Thr Val Ser Glu 
55 1955 1960 1965 

Ser Gly Thr Ser Asp Ala Thr Gly Asn Lys Thr Cys Asp Asp Lys Asp 

1970 1975 1980 

Lys Cys Asp Glu Cys Lys Arg Ala Cys Thr Thr Tyr Lys Thr Trp Leu 
1985 1990 1995 2000 
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Lys Asn Trp Lys Thr Gin Tyr Lys Thr Gin Ser Lys Lys Tyr Phe Asp 

2005 2010 2015 

Asp Lys Arg Lys Glu Leu Tyr Lys Ser lie Asp Asp Val Ala Ser Ser 
2020 2025 2030 

5 Thr Gin Ala Tyr Gin Tyr Leu His Ala Gin Leu Lys Lys Leu Cys Gly 
2035 2040 2045 

Asn Ala Asp Cys Lys Cys Met Asp Gly Glu Ser Lys Glu Thr Thr Gly 

2050 2055 2060 

Gin Pro Asp Asn Ser His Asp Ser His Met Pro Ala Ser Leu Asp Asp 
10 2065 2070 2075 2080 

Glu Pro Glu Glu Val Asn Gly Lys Cys Asn Cys Lys Val Lys His Arg 

2085 2090 2095 

Pro Gin Pro Pro Leu Ala Leu Pro Pro Pro Ala Pro Ser Gly Pro Pro 
2100 2105 2110 

15 Ala Glu Asp Gin He Glu His Asp Asn Arg Gly Arg Ser Glu Arg Gly 
2115 2120 2125 

Asp Gin Gly Pro Leu Pro Ala Arg Pro Pro Pro Pro Pro Gin Ala Ala 

2130 2135 2140 

Gin Pro Pro Gin Pro Lys Pro Lys Arg Thr Gly Glu Gly Leu Gly Arg 
20 2145 2150 2155 2160 

Asn Leu Pro Pro Ala Asp Arg Asn Thr Asn Leu Ser Asp Ser Glu Glu 

2165 2170 2175 

Glu Asp Asp Glu Asp Asp Asp Glu Val Gin Glu Glu Glu Glu Thr Pro 
2180 2185 2190 

25 Pro Ser Glu Ala Glu Glu Gly Glu Gly His Val Glu Thr Glu Glu Glu 
2195 2200 2205 

Thr Lys Pro Val Lys Glu Lys Thr Glu Gly Ala Gly Ala Thr Glu Val 

2210 2215 2220 

Thr Lys Gin Gly Ser Ala Pro Thr Ala Thr Thr Pro Thr Val Glu Asp 
30 2225 2230 2235 2240 

He Cys Ala Thr Val Ala Lys Ala Leu Lys Gly Asp Lys Ser Leu Asn 

2245 2250 2255 

Ala Ala Cys Ala Leu Lys Tyr Gly Lys Asn Asn Ser Arg Leu Gly Trp 
2260 2265 2270 

35 Lys Cys He Pro Thr Ser Gly Asp Lys Thr Asp Thr Ser Glu Asn Gly 
2275 2280 2285 

Ala Pro Arg Arg Ala Arg Ser Ala His Gly Gly Lys Ser Asp Ser Glu 

2290 - 2295 2300 

Lys Gly Ser He Cys Val Pro Pro Arg Arg Arg Arg Leu Tyr He Lys 
40 2305 2310 2315 2320 

Lys He Val Asp Trp Ala Glu Ser Gin Ser Lys Thr Val Thr Ser Val 

2325 2330 2335 

Asn Gly Asp Gly Asn Gly Ser Gin Glu Val Val Ser Val Asn Gly Ala 
2340 2345 2350 

45 Ser Glu Ser Gly Gly Ser Gly Ser Gly Thr Glu Ser Gin Ala Ser Asp 
2355 2360 2365 

Val Ser Gin Gly Asn Gly Ala Ser Thr Ser Pro Gin Val Ala Leu Leu 

2370 2375 2380 

His Ala Phe Val Lys Ser Ala Ala He Glu Thr Phe Phe Ala Trp His 
50 2385 2390 2395 2400 

Lys Tyr Lys Val Asp Lys Glu He Glu Glu Lys Glu Lys Gin Ala Ala 

2405 2410 2415 

Gin Asn His Leu Val Gin Arg Lys Thr Ser Glu Asn Pro Gin Lys Lys 
2420 2425 2430 

55 Leu Glu Gly Gly Glu He Pro Glu Asp Phe Lys Arg Gin Met Phe Tyr 
2435 2440 2445 

Thr Leu Gly Asp Tyr Arg Asp He Leu Val Gly Asp Lys Thr Met He 

2450 2455 2460 

Glu Ala Leu Glu Lys Ser Gly Asp Thr Lys He Glu Asp He Ser Glu 
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2465 2470 2475 2480 

Lys He Pro Lys He Leu Asp Gly Glu Asn Asn Lys Ala Ala Gly Gly 

2485 2490 2495 

Gly Pro Lys Gin Pro Asn Ser Gly Lys Thr Pro Gin Glu Trp Trp Lys 
5 2500 2505 2510 

Glu Asn Ala Lys His He Trp His Gly Met He Cys Ala Leu Thr Tyr 

2515 2520 2525 

Asn Thr Asp Ser Asn Gly Lys Asp Lys Lys He Gin Gin Val Lys Ala 
2530 2535 2540 

10 Thr Asp Asn Thr Asp Leu Phe Gin Lys Leu Lys Lys Asp Asn Asp Tyr 
2545 2550 2555 2560 

Glu Thr Val Ser Phe Gly Ala Ser Gly Thr Gly Ala Lys Ser Asn Asp 

2565 2570 2575 

Asp Thr Lys Leu Lys Asn Phe Val Val Arg Pro Thr Tyr Phe Arg Trp 
15 2580 2585 2590 

Leu Glu Glu Trp Gly Glu Glu Phe Cys Arg Lys Gin Lys His Lys Leu 

2595 2600 2605 

Tyr He He Lys Lys Asp Cys Arg Asp Asn Lys Phe Cys Ser Gly Asp 
2610 2615 2620 

20 Gly Leu Arg Cys Asp Glu Lys Val Pro Asp Lys Lys Asp He Phe Lys 
2625 2630 2635 2640 

His Phe Asp Cys Pro Ser Cys Ala Arg His Cys Arg Ser Tyr Arg Lys 

2645 2650 2655 

Trp He Glu Arg Lys Lys Thr Glu Tyr Glu Lys Gin Glu Ser Ala Tyr 
25 2660 2665 2670 

Ser Lys Gin Lys Ser Asn Tyr Val Asn Gly Ser Asn Gly Asp Gly Gly 

2675 2680 2685 

Asn Asn Asn Asp Lys Glu Phe Tyr Thr Lys Leu Glu Thr Cys Thr Lys 
2690 2695 2700 

30 Ala Thr Asn Phe Leu Glu Ser Leu Lys Gly Gin Cys He Gly Asn Asn 
2705 2710 2715 2720 

Asn Gly Gly Thr Asp He Lys Phe Ser Asn Thr Asn He Thr Phe Gly 

2725 2730 2735 

Ser Ala Glu Asp Cys Lys Pro Cys Ser Glu Phe Lys Val Asn Cys Glu 
35 2740 2745 2750 

Asn Gly Ser Cys Gly Ser Ala Lys Gin Lys Asp Cys Pro Asn Asn Thr 

2755 2760 2765 

He Thr Ser Gin Asn He Lys Gly Leu Thr Asp Gin Val Asp Met Arg 
2770 2775 2780 

40 Val Ser Asp Asn Thr Glu Ser Gly Phe Glu Gly Asp Leu Gly He Cys 
2785 2790 2795 2800 

Gin Gly Ala Gly He Phe Lys Gly He Arg Lys Asp Glu Trp Lys Cys 

2805 2810 2815 

Gly Asp Phe Cys Gly He Asp He Cys Thr Leu Glu Lys Thr Asn Asn 
45 2820 2825 2830 

Gly Lys Glu Ser Asp Lys Lys Tyr He He Met Lys Glu Phe Val Lys 

2835 2840 2845 

Arg Trp Leu Glu Tyr Phe Phe Glu Asp Tyr Asn Arg He Gin Lys Lys 
2850 2855 2860 

50 Leu Lys Thr Cys Lys Glu Asn Gly Lys Gly Ser Thr Cys He Arg Ser 
2865 2870 2875 2880 

Cys Val Asp Glu Trp He Lys Leu Lys Lys Asp Glu Trp Gin Lys He 

2885 2890 2895 

Asn Ser Asn Tyr Leu Asp Gin Asn Thr Lys Glu Asn Pro Glu Gly Asn 
55 2900 2905 2910 

Asn Leu Ser Ser Phe Leu Glu Asp Gly Pro Phe Lys Asn Glu Val Asp 

2915 2920 2925 

Lys Ala He Lys Pro Cys Gly Asn Leu Thr Asp Phe Lys Lys Ser Lys 
2930 2935 2940 
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Lys Cys Asn Gly Thr Ser Arg Ser Gly Asn Ser Glu Glu Ser Thr Lys 
2945 2950 2955 2960 

Tyr Asp Gly Val lie Cys Leu Leu Asp Asn Leu Lys Asn lie lie Lys 
2965 2970 2975 

5 Thr Cys Gin Asn Val Pro Ser Gly Lys Pro Asp Thr Pro Cys Gin Lys 
2980 2985 2990 

Ser Pro Ala Pro Val Gly Asp Asp Asp Asp Pro Leu Glu Glu Glu Asn 

2995 3000 3005 

Pro Val Thr Gin Pro Asn lie Cys Pro Gin Thr Ser Val Glu Glu Lys 
10 3010 3015 3020 

Lys Lys Glu Glu Glu Glu Lys Cys Asp Glu Lys Glu Glu Glu Glu Glu 
3025 3030 3035 3040 

Lys Glu Glu Glu Lys Asp Lys Gly Asp Glu Glu Val Lys Glu Glu Glu 
3045 3050 3055 

15 Lys Asp Lys Gly Asp Glu Glu Glu Glu Ala Glu Glu Glu Glu Glu Glu 
3060 3065 3070 

Glu Glu Glu Thr Asp Ser His He Tyr Glu Asp Tyr Ser Asp Ser Asp 

3075 3080 3085 

Ala Glu Glu Asp Asp Glu Asp Glu Ala Val Thr Glu Ser Leu Ser Pro 
20 3090 3095 3100 

Ser Glu Ser Gin Pro Lys Arg Leu Leu Arg Glu Phe Pro Ser Pro Glu 
3105 3110 3115 3120 

Leu Lys Asn Ala Met Leu Phe Ser Thr He Leu Trp Met Val Gly He 
3125 3130 3135 

25 Gly Phe Ala Ala Phe Thr Tyr Phe Phe Leu Lys Lys Lys Pro Lys Ser 
3140 3145 3150 

Pro Val Asp Leu Leu Arg Val Leu Asp He His Lys Gly Asp Tyr Gly 

3155 3160 3165 

Thr Pro Thr Pro Lys Ser Lys Asn Pirg Tyr He Pro Tyr Val Ser Asp 
30 3170 3175 3180 

Thr Tyr Lys Gly Lys Thr Tyr Leu Tyr Val Glu Gly Asp Thr Asp Glu 
3185 3190 3195 3200 

Glu Lys Tyr Met Phe Leu Ser Asp Thr Thr Asp He Thr Ser Ser Glu 
3205 3210 3215 

35 Ser Glu Tyr Glu Glu Leu Asp He Asn Asp He Tyr Val Pro Gly Ser 
3220 3225 3230 

Pro Lys Tyr Lys Thr Leu He Glu Val Val Leu Glu Pro Ser Lys Ser 

3235 3240 3245 

Asp Gly Asn Thr Pro Gly Lys Gly Asp Gly Asn Thr Leu Gly Asp Asp 
40 3250 3255 3260 

Met Val Pro Thr Thr Asn Thr Phe Thr Asp Glu Glu Trp Asn Glu Leu 
3265 3270 3275 3280 

Lys Gin Asp Phe Val Ser Gin Tyr He Gin Ser Arg Leu Pro Met Asp 
3285 3290 3295 

45 Val Pro Gin Tyr Asp Val Ser Thr Glu Ser Pro Met Asn He Gly Gly 
3300 3305 3310 

Asn Val Leu Asp Asp Gly Met Asp Glu Lys Pro Phe He Thr Ser He 

3315 3320 3325 

His Asp Arg Asp Leu Asn Ser Gly Glu Glu He Ser Tyr Asn He His 
50 3330 3335 3340 

Met Ser Thr Asn Thr Asn Asn Asp He Pro Lys Tyr Val Ser Asn Asn 
3345 3350 3355 3360 

Val Tyr Ser Gly He Asp Leu He Asn Asp Thr Leu Ser Asp Asn Lys 
3365 3370 3375 

55 His He Asp He Tyr Asp Glu Val Leu Lys Arg Lys Glu Asn Glu Leu 
3380 3385 3390 

Phe Gly Thr Asn Tyr Lys Lys Asn Thr Ser Asn Asn Ser Val Ala Lys 

3395 3400 3405 

Asn Thr Asn Ser Asp Pro He Met Asn Gin Leu Asp Leu Leu His Lys 
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3410 3415 3420 

Trp Leu Asp Arg His Arg Asp He Cys Glu Asn Trp Gly Lys Lys Glu 
3425 3430 3435 3440 

Asp He Leu Asn Lys Leu Asn Glu Gin Trp Asn Lys Asp Asn Asp Gly 
5 3445 3450 3455 

Gly Asp He Pro Asn Asp Asn Lys Lys Leu Asn Thr Asp Val Ser He 

3460 3465 3470 

Gin He Asp Met Asp Glu Thr Lys Gly Lys Lys Glu Phe Ser Asn Met 
3475 3480 3485 

10 Asp Thr He Leu Asp Asp Met Glu Asp Asp He Tyr Tyr Asp Val Asn 
3490 3495 3500 

Asp Glu Asn Pro Ser Val Asp Asp He Pro Met Asp His Asn Lys Val 
3505 3510 3515 3520 

Asp Val Pro Lys Lys Val His Val Glu Met Lys He Leu Asn Asn Thr 
15 3525 3530 3535 

Ser Asn Gly Ser Leu Glu Gin Gin Phe Pro He Ser Asp Val Trp Asn 
3540 3545 3550 

He 

20 

<210> 3 
<211> 8985 
<212> DNA 
25 <213> Plasmodium falciparum 

<220> 

<221> gene 

<222> (0) . . . (0) 

30 <223> PF11_0008 

<400> 3 

atggggtcac aaacatcaaa attttctaaa actgttgttg gaaatgaaac acacaacagt 60 
gcccgaaatg ttttggaagg ttttgcaaaa gatataaaaa gggatgtatc aaataacgca 120 

35 aaaagacatg gaaaagtttt gaagggaaat ttgagagatg ccaaatttta tcatgattat 180 
tctaagttaa gagacatacc tagaagtccc tgtgatcttg atttttggtt tcatacgaat 240 
gtttggaggg acaaagcata tgaaagagat ccttgttatg gcagacaagc aaaaaataat 300 
tacaatttgg aaggagcagt atgtacgaat agtaaaataa aaggtaatga aaacaagata 360 
aatgacattg gagcgtgtgc cccatataga agacgaaata tatgcgatta taatttagaa 420 

40 catctaaatg aaagaaatgt tttaaatact catgatttat tgggaaatgt gttagttatg 480 
gcaaaacgtg aaggtgaatc tattgttgag aaacatccaa atagaggatc ttcagaagta 540 
tgtattgccc ttgcaagaag ttttgcagat ataggagata ttttaagagg aaaagacatg 600 
tatgtcgggt atgatgaaaa agaaaaaaat cgaagaaaac aattagaaaa taagttgaaa 660 
gatattttcg ataatatata taaggatttg acgaagaaga aggggaggaa tgggaagaag 720 

45 tcggcgctac aagaacgcta caatgatcct aaaggagatt tttttcaatt acgagaagat 780 
tggtgggcgc ttaatagaga agacgtatgg aaagcattaa catgttctgc ggatgacagt 840 
gaagattatt tcatacaatc agaaggtgtt acaaaatcat ttacaaatcc taaatgtggc 900 
catggtgaca atgaggttct tacaaatctt gattatgtcc ctcaattttt acgttggttc 960 
accgaatggg cagaagagtt ttgtaggata agaaaaatta aattaggaaa ggttaaaaac 1020 

50 gaatgtcgtg gtgagacatc aggtaaaagg tattgtagtg gcgatggtta tgattgtact 1080 
aaaacagata tatcacgtaa tattttttat atggatttag attgcccacg ttgtgaagaa 114 0 
gaatgtagaa aatatgatga atggattgaa aataaagaaa atgaattaga taaacaaaaa 1200 
aataaataca ctaaagaaat tgaaaaatta aaagataatt ctaagagcaa ttatgataaa 1260 
aatttttatt taacacttac taaaaaatat ggttcaatta acttattttt agatacatta 1320 

55 aaagaaggat cacattgtag ctacaatacc atagaagata aaatagattt taacaaagca 1380 
aatcaaacat ttactagttc gaaattttgt ggggcatgtc ctttttatgg agttaaatgc 1440 
aattggaaaa catgtacaga ggttaaggaa aatgagtaca aaaaaaaaaa taaggttgat 1500 
agtacacata cgacagaaca acctactgct attgatgtac tagttactca tattagagga 1560 
acaaatattc ccgaagattt aaaggattgt aaaaaatatg gtctttttaa aggtatgcga 1620 
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aaacaagcgt ggaaatgtca atatataaat ccatatgatg aatgcaagtt gagtcctttt 1680 
gtgaaagata tagatgttga tgatcgtatt ttatttaaag tattgtttga gcgttggtta 1740 
aaatatttca tacaagattt taataatgtt aaggacaaaa ttaatagatg tacaaaattc 1800 
gaaaagggaa aagataatac atgtattaaa ggttgtaaac ataaatgcga atgtgtggaa 1860 
5 aaatggataa aaataaagga agcagaatgg aaaaaaataa atcaacatta taaccaacaa 1920 
aaaaaacatt atacctatag tgttcctcgt tgggttaata gttatttgac gcaccagcac 1980 
ttctctagtg attttattaa tgcattagaa gcttttaaaa atatacgggg attagaaaat 2040 
ttgaaggaat gtagtagtga cacttgtaaa attgaaaaaa ttagaactat agatgatgat 2100 
ttaataaaag aattaatttc taaacttaaa gataaatgtg ctatgtgtaa aaaccaacat 2160 
10 aaagcaacca aaggtaaaga atgttgtggt aaattaccta aaactctaaa tgatcaagat 2220 
gatgaagaag acgaagaata cgaagcacca ccaccaccaa cacctcctcg cacccaaaaa 2280 
aacccctgcg ttaatggtca aaaccagaag gttcgtaaaa tcaggagcgt gagacgagtg 234 0 
ccgaaaagga tgcaaaaaca agcaagtgtt cgtgttcctc gtgctcggca gggaggggag 2400 
agggagcagg ttgtaaagaa cgggcgccag gaccaccacc tccagcgggt gcttttggtg 2460 
15 ggggaggcgg aggcggagga gccggagacg gcggaggaaa agaaggagga ggaaaaggag 2520 
gaagacacgg acgggaaggt acagccacca ccagcagcaa caacacccgg ggtgaagccg 2580 
ccatgtgaca tagttgaaaa acactttaaa gataagcacg ataatactgg tgcaatagat 2640 
cattgtaatc caaaaaagga ttatcctcca tggaaaaacg acaaaagttt agtggatgaa 2700 
gatggtgtgt atatgcctcc tagaagacaa aaattatgtg taattaattt agaacatttt 2760 
20 aaagagaata catcagatga tttgagagaa gcattcatta aatgcgctgc cgcagaaact 2820 
tatttgttat ggcaaaaata taaagaggat aacaatggtg gtgaagatct acaaaaccaa 2880 
ttaaaaagtg gaaaaattcc tgaagatttt aaacgtcaaa tgttctacac atttggagat 2940 
tatagagatt tcttatttgg aacagatata tcaaaattaa ataaacatac agaagctgtt 3000 
aaaactaata tagatagaat tttcccacca actgagcgaa caaatgatac aatacgtaaa 3060 
25 gaattttggg aaaaaaacgc agaatctatt tggcaaggaa tgttatgtgc tttaagttat 3120 
aatagtaatg acaaaaaaat ggatccagat gtacaaaaag aactaaactc cacctacaac 3180 
tacgatacca taaaaaataa tctagaagac tttgcgaaca gacctcaatt cttacgatgg 324 0 
tttattgaat ggagtgatga attttgtcgt gaacggaaga agaaggaaga aaaggtagga 3300 
tcagcttgta agaatgatta tgagggatgt gcaaacacta aggataatgg taatggtaat 3360 
30 tgtgttaacg catgtaatgc atataaaaaa tacattacag acaaaaaaga acaatatgaa 3420 
aaacaagcaa agaaatttga tattgataaa agtcagaata aaccaggata tgaagattat 3480 
tctggcaaaa aggcttctga atatttgaaa gaaaaatgta taaactcatc atgtgattat 3540 
atgctaaaat taaaagataa ttccaattac tgggaaaaac ctcatacaac atacgacgat 3600 
aattcacttc aaaacaaatg tagctgcccc ctttctccct gcgaaatcgt ggataaaaca 3660 
35 ctgggcgaca aaacctcaaa gtcttacgcc gaagggtgca aatggaaata tgggaagatg 3720 
ccactaggtt tgggatggtt atgtaatgac aaagagggtg agaagggtaa ggaggacggt 3780 
ttatgtatac cgccaaggag aaagagatta tatgtaaaag atttagagac attcagcgat 3840 
cacacaacag taggtctccg agaagctttt attaaatgtg ctgctgtaga aacttttttt 3900 
gcttggcatg aatttacaaa ggaaaaagaa agagagtata aagaagaaaa acaaagaaat 3960 
40 ggagaacttg gatttatcga cgaaaacgat caaataccaa aggacccaga taatccacag 4020 
aacaaaataa gaaaaaatgg agaaatacat gaagagttta aaagtcaaat gttctatacc 4 080 
cttgcagatt atagagatat attatttggg aataatatag gtattggtaa cgacatgggg 4140 
aaagttaaaa gtaatataga taaggttttc gcaaatagta gtggcaaaac acctactgcc 4200 
aaaaaaacaa caccaaaaga atggtgggaa aaaaatgcaa aagatatatg ggaaggaatg 4260 
45 ttatgtgctt taagttatga tacgaaaaca aaaattaaga atgaagaact gcgcaagaaa 4320 
cttatagatc caaagaacag caactacatg tacgaaaaag ttacatttag tagtgataat 4380 
aacacaaatt tgtcaaaatt tacagaaaga cccccgtttt ttcgatggtt tcaggaatgg 4440 
ggagaagagt tttgtcgaaa aaaaaaaata aaaattgata aaattgaaaa agaatgtcgt 4500 
ggaccatatg gtcgaaatca ttgtgatggc gatggattcg actgtagtga aataggccca 4 560 
50 aatgagaatg gaagttttgc gatttttaaa tgtccaagtt gtgcaatttc ttgtcgatct 4 620 
tataaaacgt ggataaacac aaaaaaagac gaatttaaaa aacaagaaaa actatataat 4 680 
aaagaaatta aggataataa aagtaattat gataatatat atgataaaga atttgttaaa 4740 
aacctttgta cagattataa gtctgttgac tcatttttaa aaaagttaaa agaaggccca 4800 
tgttgtaaca aaaatactaa agacagtaaa atagatttta aggatacaga ggagacattt 4860 
55 agaaacgctg aatattgcga tccatgtcct gtatttggag ttatttgtaa taacggtgat 4920 
tgcagtaatt ctacagaaaa gaagtgtgat gcacaagaat ttaaggttac atatgatgta 4 980 
aaaaataagg aaaaccctaa taaagaagta aatatgcttg tcagtgataa aacagcaaag 5040 
aaatacccag gtgatttaaa cggtgtttgc gaaaattcaa gtatctttga aggtattaga 5100 
gaagataaat ggtcatgtgg ttatttctgt ggtttagata tatgtacacc caacaaaact 5160 
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acaggtgata 
gaaaattttt 
gaaaacagaa 
atagagatga 
5 gttgctgatt 
gaccttgaaa 
tgtactaata 
ttgcttaata 
ggcaaggaaa 
10 acccacgaca 
gcaaatccat 
gaaatgcacg 
aaggttaagg 
aaacatgaag 
15 accaattatc 
ggtaaagaca 
accattaaag 
ttacttcatg 
tttttggggg 
20 acgagattaa 
gctgatatag 
ctacaaacta 
aaaaaatata 
aatcgagcca 
25 tgtgatacta 
accgaatggg 
cagtgtagga 
acacaagctt 
a tatcaaaaa 
30 actagtagta 
caaaaaaata 
catatgactg 
aagaaatatg 
ccatcaacgc 
35 gaatctgaac 
aaggagacgg 
ccagtaccag 
gaggatgacg 
gaagacgatg 
40 ggtgaaggtg 
caggaggcgg 
ttggatgacc 
atcggttttg 
ttgttgcgtg 
45 aataggtaca 
ggagatagtg 
gaatatgaag 
ttgattgaag 
ggtaacacac 
50 agtgaattga 
aaagttggtg 
ataaacgaaa 
attaaatata 
aatgtatatt 
55 atatatgatg 
aatacatcaa 
gatttgttac 
gaagaattgt 
ccaaatgata 



tacatgataa 
taaaagatca 
atatatgtac 
aaatgaaaga 
cagtagttta 
aagttaaagg 
ctgtatcaac 
aacttaaaaa 
gttgcaaaac 
cacctgacat 
gtggcgacaa 
aagaggcaca 
atagtacggt 
gtaatccaga 
aaaaacgtgg 
caagatttgt 
ttctgttgcc 
ttaataaggg 
atgttttgct 
atggccaaaa 
gtgatattat 
agttagtaac 
gtagtgaaaa 
aagtatggga 
ccactgttac 
ccgaatggtt 
attgtaggag 
gtaatacata 
aatacaaaac 
ctacagatga 
ctgataatac 
attgtcaaaa 
cttttcgtca 
cagtagacgt 
ctgaatccga 
ctaccacaga 
aactaccggg 
aagacgccga 
acgacgacga 
aaggcgacgg 
agggggtggt 
ccctccttaa 
cggcgttgac 
tactgaatat 
taccatatag 
atagtggaca 
aattggatat 
tggtactaga 
taggtgatga 
aacatgattt 
tatcaaagga 
aaccttttat 
atattaatat 
ctggtatcga 
aagtgctaaa 
ataacaatgt 
atacatggtt 
tagataaatt 
acaaaaagtt 



acaaaatgca 
taacaaaatt 
agatgtttgc 
atggaaaata 
cgaagtgaga 
agatgttaat 
agaaaataga 
tgaaatacgt 
attacccgaa 
accaccaggt 
aagcgccacc 
caaggatatg 
tgagagtgtg 
tgacttgaaa 
tggttataat 
cataggaacc 
tccgcgacgt 
cccacttcta 
tgcagcaaaa 
tgacaatgga 
acgaggaaaa 
aatatttggt 
tcccccatat 
agcaatgcaa 
ccctcttgtg 
ctgcaaaatg 
tggaatatgt 
taaacaaaaa 
attataccaa 
gaaagacaaa 
catatatact 
acaaacaatt 
tccaccacat 
ctcccgcaaa 
agaagaagaa 
gacaacacaa 
accacccgca 
aaacgaagac 
agatgacgac 
cggtgacgtc 
accacaacca 
aaccgccctc 
ttacttttta 
cccgaaagga 
aagtggttca 
ctactacgaa 
taatgagata 
accatccaaa 
tatggtacct 
tatatcacaa 
attaccaatg 
tacttctatt 
gggtactaat 
tttaattaat 
aagaaaagaa 
agctaaatta 
agatagacat 
aaatgaacaa 
gaatacggat 



ccaattagag 
aaagacaaaa 
agaaaaaatt 
gtacgcgatc 
agatttttag 
gatttacgtg 
aaatgtagaa 
cattgtaaaa 
cctacagacg 
gacgttgcac 
aatgtggtaa 
ttagagagga 
ttaagggctg 
cacaacatgt 
tatcgaggac 
atatggaaag 
cgtcatatgt 
aaagttgaac 
tatgaagcag 
gctaaatgta 
gatctgtggg 
aaaattaaag 
acgacattac 
tgtccaacaa 
gattacatcc 
cagtcacagg 
gagaatggta 
ataaaaaaat 
caagcaaaag 
gatgtcgttg 
actgctgcag 
ttttgtaaaa 
gatcatgatg 
ctagacaccc 
gacgatgccg 
ccagcagcac 
ccagcgggac 
gatgatgacg 
gacgaagacg 
ggcgaggagg 
gcagcaccac 
atgtcttcta 
ctcaagaaaa 
gattatgaaa 
tataaaggca 
gatacaactg 
tatccgtatc 
agtaatggta 
accacgaata 
tatatacaaa 
aatataggag 
catgataggg 
agtatggacg 
gacacattaa 
aatgaattat 
acaaatagtg 
agagatatgt 
tggaataaag 
gtttctatac 



tactgtttaa 
tttctttatg 
gtgaatgtat 
gttacgtcaa 
agggcttgca 
atttagagga 
aaaaggatgt 
atgaacgtga 
atccacaaac 
ccactttttg 
atgtgacaga 
gtgttaaaaa 
atgcatcaaa 
gcaatataac 
catgtacggg 
atgaagacga 
gtacatcaaa 
ctgataaaat 
aattcataaa 
gagctatgaa 
gaattcagga 
aggaaattcc 
gtgaacattg 
taccaccagt 
cacaaagat t 
agtatgaggt 
aggatgactg 
gggaagatca 
gcagtgttaa 
atttcttgaa 
gatttataca 
acactagtta 
atgcgtgtgc 
aacgtgaccc 
aagaggagga 
cagcgggacc 
cagcagctga 
tcggctccgc 
aagaggactc 
aagatgaaga 
aaccaccaac 
ccatcctctg 
aatccaaatc 
tgcctacgtt 
aaacatatat 
atattacttc 
agtcaccaaa 
acacaccaag 
catttacaga 
gtgaaccact 
gtaatgtttt 
atttatatac 
atccaacata 
gtggtaatca 
ttggaacaaa 
atcctattat 
gtgagaagtg 
ataatgatgg 
aaatagatat 



acgatggata 
tataaataat 
agataaatgg 
acaatataat 
acctcaaaat 
actaagtgaa 
agtagaaagt 
cgatagtatg 
agatagtgat 
taatgttcca 
ggtggcgaag 
ggttgagagt 
aggtgaatat 
gaaggaacat 
taaaggtaat 
aaaagatgaa 
tttagaatat 
taatcattcc 
aaccaattat 
atatagtttt 
cttcaaggafc 
cgatattaaa 
gtgggaagca 
caccacaagt 
acgttggatg 
gttagtgaaa 
tgtcaagtgc 
atggaaggaa 
tggtgctact 
aatgttacac 
tcaagaagca 
taacgacaag 
ttgcaggcca 
caaaaaagag 
ggagccggca 
accggtaaca 
tggccccatt 
caccggcaca 
agcagacgaa 
tcacggcggc 
cccacaactc 
gatggtaggt 
ttctgttgac 
gaaatccaaa 
atatatggaa 
atccgaaagt 
atacaaaaca 
taagggtgat 
tgaggaatgg 
ggatgtacca 
agatgatggt 
tggggaagaa 
tgtatcaaat 
acatattgat 
ttataagaaa 
gaaccaatta 
gaataaaaag 
tggtgatata 
agatgaaaat 



5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
60 60 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
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aaaggaaaga aagaatttag taatatggat acaaacgtgg atacacctac tatggatagt 8760 
atattggatg atttggaaac atataatgaa cctttttatg atatatttga ggatgatgtg 8820 
tattatgatg tatatgatga aaacccattt gtggatgata tacctatgga tcataataaa 8880 
gtagatgtac ctaagaaagt acatattgaa atgaaaatcc ttaataatac atccaatgga 8940 
tcgttggaac aacaatttcc tatatcggat gtatggaata tataa 8985 



<210> 4 
10 <211> 2994 
<212> PRT 

<213> Plasmodium falciparuin 
<400> 4 

15 Met Gly Ser Gin Thr Ser Lys Phe Ser Lys Thr Val Val Gly Asn Glu 
15 10 15 

Thr His Asn Ser Ala Arg Asn Val Leu Glu Gly Phe Ala Lys Asp lie 

20 25 30 

Lys Arg Asp Val Ser Asn Asn Ala Lys Arg His Gly Lys Val Leu Lys 
20 35 40 45 

Gly Asn Leu Arg Asp Ala Lys Phe Tyr His Asp Tyr Ser Lys Leu Arg 

50 55 60 

Asp lie Pro Arg Ser Pro Cys Asp Leu Asp Phe Trp Phe His Thr Asn 
65 70 75 80 

25 Val Trp Arg Asp Lys Ala Tyr Glu Arg Asp Pro Cys Tyr Gly Arg Gin 

85 90 95 

Ala Lys Asn Asn Tyr Asn Leu Glu Gly Ala Val Cys Thr Asn Ser Lys 

100 105 110 

He Lys Gly Asn Glu Asn Lys He Asn Asp lie Gly Ala Cys Ala Pro 
30 115 120 125 

Tyr Arg Arg Arg Asn He Cys Asp Tyr Asn Leu Glu His Leu Asn Glu 

130 135 140 

Arg Asn Val Leu Asn Thr His Asp Leu Leu Gly Asn Val Leu Val Met 
145 150 155 160 

35 Ala Lys Arg Glu Gly Glu Ser He Val Glu Lys His Pro Asn Arg Gly 

165 170 175 

Ser Ser Glu Val Cys He Ala Leu Ala Arg Ser Phe Ala Asp He Gly 

180 185 190 

Asp He Leu Arg Gly Lys Asp Met Tyr Val Gly Tyr Asp Glu Lys Glu 
40 195 200 205 

Lys Asn Arg Arg Lys Gin Leu Glu Asn Lys Leu Lys Asp He Phe Asp 

210 215 220 

Asn He Tyr Lys Asp Leu Thr Lys Lys Lys Gly Arg Asn Gly Lys Lys 
225 230 235 240 

45 Ser Ala Leu Gin Glu Arg Tyr Asn Asp Pro Lys Gly Asp Phe Phe Gin 

245 250 255 

Leu Arg Glu Asp Trp Trp Ala Leu Asn Arg Glu Asp Val Trp Lys Ala 

260 265 270 

Leu Thr Cys Ser Ala Asp Asp Ser Glu Asp Tyr Phe He Gin Ser Glu 
50 275 280 285 

Gly Val Thr Lys Ser Phe Thr Asn Pro Lys Cys Gly His Gly Asp Asn 

290 295 300 

Glu Val Leu Thr Asn Leu Asp Tyr Val Pro Gin Phe Leu Arg Trp Phe 
305 310 315 320 

55 Thr Glu Trp Ala Glu Glu Phe Cys Arg He Arg Lys He Lys Leu Gly 

325 330 335 

Lys Val Lys Asn Glu Cys Arg Gly Glu Thr Ser Gly Lys Arg Tyr Cys 

340 345 350 

Ser Gly Asp Gly Tyr Asp Cys Thr Lys Thr Asp He Ser Arg Asn He 



36017DK01 



16/31 



355 360 365 

Phe Tyr Met Asp Leu Asp Cys Pro Arg Cys Glu Glu Glu Cys Arg Lys 

370 375 380 

Tyr Asp Glu Trp lie Glu Asn Lys Glu Asn Glu Leu Asp Lys Gin Lys 
5 385 390 395 400 

Asn Lys Tyr Thr Lys Glu lie Glu Lys Leu Lys Asp Asn Ser Lys Ser 

405 410 415 

Asn Tyr Asp Lys Asn Phe Tyr Leu Thr Leu Thr Lys Lys Tyr Gly Ser 
420 425 430 

10 lie Asn Leu Phe Leu Asp Thr Leu Lys Glu Gly Ser His Cys Ser Tyr 
435 440 445 

Asn Thr He Glu Asp Lys He Asp Phe Asn Lys Ala Asn Gin Thr Phe 

450 455 460 

Thr Ser Ser Lys Phe Cys Gly Ala Cys Pro Phe Tyr Gly Val Lys Cys 
15 465 470 475 480 

Asn Trp Lys Thr Cys Thr Glu Val Lys Glu Asn Glu Tyr Lys Lys Lys 

485 490 495 

Asn Lys Val Asp Ser Thr His Thr Thr Glu Gin Pro Thr Ala He Asp 
500 505 510 

20 Val Leu Val Thr His He Arg Gly Thr Asn He Pro Glu Asp Leu Lys 
515 520 525 

Asp Cys Lys Lys Tyr Gly Leu Phe Lys Gly Met Arg Lys Gin Ala Trp 

530 535 540 

Lys Cys Gin Tyr He Asn Pro Tyr Asp Glu Cys Lys Leu Ser Pro Phe 
25 545 550 555 560 

Val Lys Asp He Asp Val Asp Asp Arg He Leu Phe Lys Val Leu Phe 

565 570 575 

Glu Arg Trp Leu Lys Tyr Phe He Gin Asp Phe Asn Asn Val Lys Asp 
580 585 590 

30 Lys He Asn Arg Cys Thr Lys Phe Glu Lys Gly Lys Asp Asn Thr Cys 
595 600 605 

He Lys Gly Cys Lys His Lys Cys Glu Cys Val Glu Lys Trp He Lys 

610 615 620 

He Lys Glu Ala Glu Trp Lys Lys He Asn Gin His Tyr Asn Gin Gin 
35 625 630 635 640 

Lys Lys His Tyr Thr Tyr Ser Val Pro Arg Trp Val Asn Ser Tyr Leu 

645 650 655 

Thr His Gin His Phe Ser Ser Asp Phe He Asn Ala Leu Glu Ala Phe 
660 665 670 

40 Lys Asn He Arg Gly Leu Glu Asn Leu Lys Glu Cys Ser Ser Asp Thr 
675 680 685 

Cys Lys He Glu Lys He Arg Thr He Asp Asp Asp Leu He Lys Glu 

690 695 700 

Leu He Ser Lys Leu Lys Asp Lys Cys Ala Met Cys Lys Asn Gin His 
45 705 710 715 720 

Lys Ala Thr Lys Gly Lys Glu Cys Cys Gly Lys Leu Pro Lys Thr Leu 

725 730 735 

Asn Asp Gin Asp Asp Glu Glu Asp Glu Glu Tyr Glu Ala Pro Pro Pro 
740 745 750 

50 Pro Thr Pro Pro Arg Thr Gin Lys Asn Pro Cys Val Asn Gly Gin Asn 
755 760 765 

Gin Lys Val Arg Lys He Arg Ser Val Arg Arg Val Pro Lys Arg Met 

770 775 780 

Gin Lys Gin Ala Ser Val Arg Val Pro Arg Ala Arg Gin Gly Gly Glu 
55 785 790 795 800 

Arg Glu Gin Val Val Lys Asn Gly Arg Gin Asp His His Leu Gin Arg 

805 810 815 

Val Leu Leu Val Gly Glu Ala Glu Ala Glu Glu Pro Glu Thr Ala Glu 
820 825 830 
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Glu Lys Lys Glu Glu Glu Lys Glu Glu Asp Thr Asp Gly Lys Val Gin 

835 840 845 

Pro Pro Pro Ala Ala Thr Thr Pro Gly Val Lys Pro Pro Cys Asp He 
850 855 860 

5 Val Glu Lys His Phe Lys Asp Lys His Asp Asn Thr Gly Ala He Asp 
865 870 875 880 

His Cys Asn Pro Lys Lys Asp Tyr Pro Pro Trp Lys Asn Asp Lys Ser 

885 890 895 

Leu Val Asp Glu Asp Gly Val Tyr Met Pro Pro Arg Arg Gin Lys Leu 
10 900 905 910 

Cys Val He Asn Leu Glu His Phe Lys Glu Asn Thr Ser Asp Asp Leu 

915 920 925 

Arg Glu Ala Phe He Lys Cys Ala Ala Ala Glu Thr Tyr Leu Leu Trp 
930 935 940 

15 Gin Lys Tyr Lys Glu Asp Asn Asn Gly Gly Glu Asp Leu Gin Asn Gin 
945 950 955 960 

Leu Lys Ser Gly Lys He Pro Glu Asp Phe Lys Arg Gin Met Phe Tyr 

965 970 975 

Thr Phe Gly Asp Tyr Arg Asp Phe Leu Phe Gly Thr Asp He Ser Lys 
20 980 985 990 

Leu Asn Lys His Thr Glu Ala Val Lys Thr Asn He Asp Arg He Phe 

995 1000 1005 

Pro Pro Thr Glu Arg Thr Asn Asp Thr He Arg Lys Glu Phe Trp Glu 
1010 1015 1020 

25 Lys Asn Ala Glu Ser He Trp Gin Gly Met Leu Cys Ala Leu Ser Tyr 
1025 1030 1035 1040 

Asn Ser Asn Asp Lys Lys Met Asp Pro Asp Val Gin Lys Glu Leu Asn 

1045 1050 1055 

Ser Thr Tyr Asn Tyr Asp Thr He Lys Asn Asn Leu Glu Asp Phe Ala 
30 1060 1065 1070 

Asn Arg Pro Gin Phe Leu Arg Trp Phe He Glu Trp Ser Asp Glu Phe 

1075 1080 1085 

Cys Arg Glu Arg Lys Lys Lys Glu Glu Lys Val Gly Ser Ala Cys Lys 
1090 1095 1100 

35 Asn Asp Tyr Glu Gly Cys Ala Asn Thr Lys Asp Asn Gly Asn Gly Asn 
1105 1110 1115 1120 

Cys Val Asn Ala Cys Asn Ala Tyr Lys Lys Tyr He Thr Asp Lys Lys 

1125 1130 1135 

Glu Gin Tyr Glu Lys Gin Ala Lys Lys Phe Asp He Asp Lys Ser Gin 
40 1140 1145 1150 

Asn Lys Pro Gly Tyr Glu Asp Tyr Ser Gly Lys Lys Ala Ser Glu Tyr 

1155 1160 1165 

Leu Lys Glu Lys Cys He Asn Ser Ser Cys Asp Tyr Met Leu Lys Leu 
1170 1175 1180 

45 Lys Asp Asn Ser Asn Tyr Trp Glu Lys Pro His Thr Thr Tyr Asp Asp 
1185 1190 1195 1200 

Asn Ser Leu Gin Asn Lys Cys Ser Cys Pro Leu Ser Pro Cys Glu He 

1205 1210 1215 

Val Asp Lys Thr Leu Gly Asp Lys Thr Ser Lys Ser Tyr Ala Glu Gly 
50 1220 1225 1230 

Cys Lys Trp Lys Tyr Gly Lys Met Pro Leu Gly Leu Gly Trp Leu Cys 

1235 1240 1245 

Asn Asp Lys Glu Gly Glu Lys Gly Lys Glu Asp Gly Leu Cys He Pro 
1250 1255 1260 

55 Pro Arg Arg Lys Arg Leu Tyr Val Lys Asp Leu Glu Thr Phe Ser Asp 
1265 1270 1275 1280 

His Thr Thr Val Gly Leu Arg Glu Ala Phe He Lys Cys Ala Ala Val 

1285 1290 1295 

Glu Thr Phe Phe Ala Trp His Glu Phe Thr Lys Glu Lys Glu Arg Glu 
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1300 1305 1310 

Tyr Lys Glu Glu Lys Gin Arg Asn Gly Glu Leu Gly Phe lie Asp Glu 

1315 1320 1325 

Asn Asp Gin lie Pro Lys Asp Pro Asp Asn Pro Gin Asn Lys lie Arg 
5 1330 1335 1340 

Lys Asn Gly Glu He His Glu Glu Phe Lys Ser Gin Met Phe Tyr Thr 
1345 1350 1355 1360 

Leu Ala Asp Tyr Arg Asp He Leu Phe Gly Asn Asn He Gly He Gly 
1365 1370 1375 

10 Asn Asp Met Gly Lys Val Lys Ser Asn He Asp Lys Val Phe Ala Asn 
1380 1385 1390 

Ser Ser Gly Lys Thr Pro Thr Ala Lys Lys Thr Thr Pro Lys Glu Trp 

1395 1400 1405 

Trp Glu Lys Asn Ala Lys Asp He Trp Glu Gly Met Leu Cys Ala Leu 
15 1410 1415 1420 

Ser Tyr Asp Thr Lys Thr Lys He Lys Asn Glu Glu Leu Arg Lys Lys 
1425 1430 1435 1440 

Leu He Asp Pro Lys Asn Ser Asn Tyr Met Tyr Glu Lys Val Thr Phe 
1445 1450 1455 

20 Ser Ser Asp Asn Asn Thr Asn Leu Ser Lys Phe Thr Glu Arg Pro Pro 
1460 1465 1470 

Phe Phe Arg Trp Phe Gin Glu Trp Gly Glu Glu Phe Cys Arg Lys Lys 

1475 1480 1485 

Lys He Lys He Asp Lys He Glu Lys Glu Cys Arg Gly Pro Tyr Gly 
25 1490 1495 1500 

Arg Asn His Cys Asp Gly Asp Gly Phe Asp Cys Ser Glu He Gly Pro 
1505 1510 1515 1520 

Asn Glu Asn Gly Ser Phe Ala He Phe Lys Cys Pro Ser Cys Ala He 
1525 1530 1535 

30 Ser Cys Arg Ser Tyr Lys Thr Trp He Asn Thr Lys Lys Asp Glu Phe 
1540 1545 1550 

Lys Lys Gin Glu Lys Leu Tyr Asn Lys Glu He Lys Asp Asn Lys Ser 

1555 1560 1565 

Asn Tyr Asp Asn He Tyr Asp Lys Glu Phe Val Lys Asn Leu Cys Thr 
35 1570 1575 1580 

Asp Tyr Lys Ser Val Asp Ser Phe Leu Lys Lys Leu Lys Glu Gly Pro 
1585 1590 1595 1600 

Cys Cys Asn Lys Asn Thr Lys Asp Ser Lys He Asp Phe Lys Asp Thr 
1605 1610 1615 

40 Glu Glu Thr Phe Arg Asn Ala Glu Tyr Cys Asp Pro Cys Pro Val Phe 
1620 1625 1630 

Gly Val He Cys Asn Asn Gly Asp Cys Ser Asn Ser Thr Glu Lys Lys 

1635 1640 1645 

Cys Asp Ala Gin Glu Phe Lys Val Thr Tyr Asp Val Lys Asn Lys Glu 
45 1650 1655 1660 

Asn Pro Asn Lys Glu Val Asn Met Leu Val Ser Asp Lys Thr Ala Lys 
1665 1670 1675 1680 

Lys Tyr Pro Gly Asp Leu Asn Gly Val Cys Glu Asn Ser Ser He Phe 
1685 1690 1695 

50 Glu Gly He Arg Glu Asp Lys Trp Ser Cys Gly Tyr Phe Cys Gly Leu 
1700 1705 1710 

Asp He Cys Thr Pro Asn Lys Thr Thr Gly Asp He His Asp Lys Gin 

1715 1720 1725 

Asn Ala Pro He Arg Val Leu Phe Lys Arg Trp He Glu Asn Phe Leu 
55 1730 1735 1740 

Lys Asp His Asn Lys He Lys Asp Lys He Ser Leu Cys He Asn Asn 
1745 1750 1755 1760 

GJu Asn Arg Asn He Cys Thr Asp Val Cys Arg Lys Asn Cys Glu Cys 
1765 1770 1775 
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lie Asp Lys Trp He Glu Met Lys Met hys Glu Trp Lys He Val Arg 

1780 1785 1.790 

Asp Arg Tyr Val Lys Gin Tyr Asn Val Ala Asp Ser Val Val Tyr Glu 
1795 1800 1805 

5 Val Arg Arg Phe Leu Glu Gly Leu Gin Pro Gin Asn Asp Leu Glu Lys 
1810 1815 1820 

Val Lys Gly Asp Val Asn Asp Leu Arg Asp Leu Glu Glu Leu Ser Glu 
1825 1830 1835 1840 

Cys Thr Asn Thr Val Ser Thr Glu Asn Arg Lys Cys Arg Lys Lys Asp 
10 1845 1850 1855 

Val Val Glu Ser Leu Leu Asn Lys Leu Lys Asn Glu He Arg His Cys 

I860 1865 1870 

Lys Asn Glu Arg Asp Asp Ser Met Gly Lys Glu Ser Cys Lys Thr Leu 
1875 1880 1885 

15 Pro Glu Pro Thr Asp Asp Pro Gin Thr Asp Ser Asp Thr His Asp Thr 
1890 1895 1900 

Pro Asp He Pro Pro Gly Asp Val Ala Pro Thr Phe Cys Asn Val Pro 
1905 1910 1915 1920 

Ala Asn Pro Cys Gly Asp Lys Ser Ala Thr Asn Val Val Asn Val Thr 
20 1925 1930 1935 

Glu Val Ala Lys Glu Met His Glu Glu Ala His Lys Asp Met Leu Glu 

1940 1945 1950 

Arg Ser Val Lys Lys Val Glu Ser Lys Val Lys Asp Ser Thr Val Glu 
1955 I960 1965 

25 Ser Val Leu Arg Ala Asp Ala Ser Lys Gly Glu Tyr Lys His Glu Gly 
1970 1975 1980 

Asn Pro Asp Asp Leu Lys His Asn Met Cys Asn He Thr Lys Glu His 
1985 1990 1995 2000 

Thr Asn Tyr Gin Lys Arg Gly Gly Tyr Asn Tyr Arg Gly Pro Cys Thr 
30 2005 2010 2015 

Gly Lys Gly Asn Gly Lys Asp Thr Arg Phe Val He Gly Thr He Trp 

2020 2025 2030 

Lys Asp Glu Asp Glu Lys Asp Glu Thr He Lys Val Leu Leu Pro Pro 
2035 2040 2045 

35 Arg Arg Arg His Met Cys Thr Ser Asn Leu Glu Tyr Leu Leu His Val 
2050 2055 2060 

Asn Lys Gly Pro Leu Leu Lys Val Glu Pro Asp Lys He Asn His Ser 
2065 2070 2075 2080 

Phe Leu Gly Asp Val Leu Leu Ala Ala Lys Tyr Glu Ala Glu Phe He 
40 2085 2090 2095 

Lys Thr Asn Tyr Thr Arg Leu Asn Gly Gin Asn Asp Asn Gly Ala Lys 

2100 2105 2110 

Cys Arg Ala Met Lys Tyr Ser Phe Ala Asp He Gly Asp He He Arg 
2115 2120 2125 

45 Gly Lys Asp Leu Trp Gly He Gin Asp Phe Lys Asp Leu Gin Thr Lys 
2130 2135 2140 

Leu Val Thr He Phe Gly Lys He Lys Glu Glu He Pro Asp He Lys 
2145 2150 2155 2160 

Lys Lys Tyr Ser Ser Glu Asn Pro Pro Tyr Thr Thr Leu Arg Glu His 
50 2165 2170 2175 

Trp Trp Glu Ala Asn Arg Ala Lys Val Trp Glu Ala Met Gin Cys Pro 

2180 2185 2190 

Thr He Pro Pro Val Thr Thr Ser Cys Asp Thr Thr Thr Val Thr Pro 
2195 2200 2205 

55 Leu Val Asp Tyr He Pro Gin Arg Leu Arg Trp Met Thr Glu Trp Ala 
2210 2215 2220 

Glu Trp Phe Cys Lys Met Gin Ser Gin Glu Tyr Glu Val Leu Val Lys 
2225 2230 2235 2240 

Gin Cys Arg Asn Cys Arg Ser Gly He Cys Glu Asn Gly Lys Asp Asp 
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2245 2250 2255 

Cys Val Lys Cys Thr Gin Ala Cys Asn Thr Tyr Lys Gin Lys He Lys 

2260 2265 2270 

Lys Trp Glu Asp Gin Trp Lys Glu He Ser Lys Lys Tyr Lys Thr Leu 
5 2275 2280 2285 

Tyr Gin Gin Ala Lys Gly Ser Val Asn Gly Ala Thr Thr Ser Ser Thr 

2290 2295 2300 

Thr Asp Glu Lys Asp Lys Asp Val Val Asp Phe Leu Lys Met Leu His 
2305 2310 2315 2320 

10 Gin Lys Asn Thr Asp Asn Thr He Tyr Thr Thr Ala Ala Gly Phe He 

2325 2330 2335 

His Gin Glu Ala His Met Thr Asp Cys Gin Lys Gin Thr He Phe Cys 

2340 2345 2350 

Lys Asn Thr Ser Tyr Asn Asp Lys Lys Lys Tyr Ala Phe Arg His Pro 
15 2355 2360 2365 

Pro His Asp His Asp Asp Ala Cys Ala Cys Arg Pro Pro Ser Thr Pro 

2370 2375 2380 

Val Asp Val Ser Arg Lys Leu Asp Thr Gin Arg Asp Pro Lys Lys Glu 
2385 2390 2395 , 2400 

20 Glu Ser Glu Pro Glu Ser Glu Glu Glu Glu Asp Asp Ala Glu Glu Glu 

2405 2410 2415 

Glu Glu Pro Ala Lys Glu Thr Ala Thr Thr Glu Thr Thr Gin Pro Ala 

2420 2425 2430 

Ala Pro Ala Gly Pro Pro Val Thr Pro Val Pro Glu Leu Pro Gly Pro 
25 2435 2440 2445 

Pro Ala Pro Ala Gly Pro Ala Ala Asp Gly Pro He Glu Asp Asp Glu 

2450 2455 2460 

Asp Ala Glu Asn Glu Asp Asp Asp Asp Val Gly Ser Ala Thr Gly Thr 
2465 2470 2475 2480 

30 Glu Asp Asp Asp Asp Asp Glu Asp Asp Asp Asp Glu Asp Glu Glu Asp 

2485 2490 2495 

Ser Ala Asp Glu Gly Glu Gly Glu Gly Asp Gly Gly Asp Val Gly Glu 

2500 2505 2510 

Glu Glu Asp Glu Asp His Gly Gly Gin Glu Ala Glu Gly Val Val Pro 
35 2515 2520 2525 

Gin Pro Ala Ala Pro Gin Pro Pro Thr Pro Gin Leu Leu Asp Asp Pro 

2530 2535 2540 

Leu Leu Lys Thr Ala Leu Met Ser Ser Thr He Leu Trp Met Val Gly 
2545 2550 2555 2560 

40 He Gly Phe Ala Ala Leu Thr Tyr Phe Leu Leu Lys Lys Lys Ser Lys 

2565 2570 2575 

Ser Ser Val Asp Leu Leu Arg Val Leu Asn He Pro Lys Gly Asp Tyr 

2580 2585 2590 

Glu Met Pro Thr Leu Lys Ser Lys Asn Arg Tyr He Pro Tyr Arg Ser 
45 2595 2600 2605 

Gly Ser Tyr Lys Gly Lys Thr Tyr He Tyr Met Glu Gly Asp Ser Asp 

2610 2615 2620 

Ser Gly His Tyr Tyr Glu Asp Thr Thr Asp He Thr Ser Ser Glu Ser 
2625 2630 2635 2640 

50 Glu Tyr Glu Glu Leu Asp He Asn Glu He Tyr Pro Tyr Gin Ser Pro 

2645 2650 2655 

Lys Tyr Lys Thr Leu He Glu Val Val Leu Glu Pro Ser Lys Ser Asn 

2660 2665 2670 

Gly Asn Thr Pro Ser Lys Gly Asp Gly Asn Thr Leu Gly Asp Asp Met 
55 2675 2680 2685 

Val Pro Thr Thr Asn Thr Phe Thr Asp Glu Glu Trp Ser Glu Leu Lys 

2690 2695 2700 

His Asp Phe He Ser Gin Tyr He Gin Ser Glu Pro Leu Asp Val Pro 
2705 2710 2715 2720 
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Lvs Val Gly Val Ser Lys Glu Leu Pro Met Asn lie Gly Gly Asn Val 

^ 2725 2730 2735 

Leu Asp Asp Gly He Asn Glu Lys Pro Phe He Thr Ser He His Asp 
2740 2745 2750 

5 Arq Asp Leu Tyr Thr Gly Glu Glu lie Lys Tyr Asn He Asn Met Gly 
2755 2760 2765 

Thr Asn Ser Met Asp Asp Pro Thr Tyr Val Ser Asn Asn Val Tyr Ser 

2770 2775 2780 

Gly He Asp Leu He Asn Asp Thr Leu Ser Gly Asn Gin His He Asp 
10 2785 2790 2795 2800 

He Tyr Asp Glu Val Leu Lys Arg Lys Glu Asn Glu Leu Phe Gly Thr 

2805 2810 2815 

Asn Tyr Lys Lys Asn Thr Ser Asn Asn Asn Val Ala Lys Leu Thr Asn 
2820 2825 2830 

15 Ser Asp Pro He Met Asn Gin Leu Asp Leu Leu His Thr Trp Leu Asp 
2835 2840 2845 

Arg His Arg Asp Met Cys Glu Lys Trp Asn Lys Lys Glu Glu Leu Leu 

2850 2855 2860 

Asp Lys Leu Asn Glu Gin Trp Asn Lys Asp Asn Asp Gly Gly Asp He 
20 2865 2870 2875 2880 

Pro Asn Asp Asn Lys Lys Leu Asn Thr Asp Val Ser He Gin He Asp 

2885 2890 2895 

He Asp Glu Asn Lys Gly Lys Lys Glu Phe Ser Asn Met Asp Thr Asn 
2900 2905 2910 

25 Val Asp Thr Pro Thr Met Asp Ser He Leu Asp Asp Leu Glu Thr Tyr 
2915 2920 2925 

Asn Glu Pro Phe Tyr Asp He Phe Glu Asp Asp Val Tyr Tyr Asp Val 

2930 2935 2940 

Tyr Asp Glu Asn Pro Phe Val Asp Asp He Pro Met Asp His Asn Lys 
30 2945 2950 2955 2960 

Val Asp Val Pro Lys Lys Val His He Glu Met Lys He Leu Asn Asn 

2965 2970 2975 

Thr Ser Asn Gly Ser Leu Glu Gin Gin Phe Pro He Ser Asp Val Trp 
2980 2985 2990 

35 Asn He 



<210> 5 
40 <211> 10041 
<212> DNA 

<213> Plasmodium falciparum 

<220> 
45 <221> gene 

<222> (0)...(0) 
<:223> PF13_0003 

<400> 5 

50 atggggaata cacaatcatc agaggaagag 

cacaacagtg caaggggtgt tttggaagaa 

aaagagagta aacatgtaag gcaattgaaa 

cgattgtata aggaatctgg tggtgattta 

acatacaaat ttcatactaa tataacaact 

55 ggtagggaaa acaatcgttt ttctgaaagt 

aaaggaaatg aaaataacag taatggtaca 

tgtgatcaaa atttagaatt tttagataat 

ggaaatgtgt tagttacagc aaaatacgaa 

aaaaacagca atggaaataa atcaggaata 



gaagctaaaa gccctagttt aacagaaagt 60 
attggaaaaa agataaaaga taagacagaa 120 
ggaaaattat caaatgcaaa atttgctgat 180 
aggtctgctt attcagatgc ttgttcactt 240 
gatggggggg atggaaggca tccttgtcat 300 
caagaatatg gatgtagtaa tgtatacata 360 
gcatgtgtac caccaagaag aagacatata 420 
cctcacactg atgatactga tgatttgttg 480 
ggtaattata ttgttagtaa tcatccagat 540 
tgtacttctc ttgcacgcag ttttgcagat 600 
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ataggagata 
ctaaaaaagg 
aatgatatat 
caagtatgga 
5 ttagatggaa 
cctacctatt 
ttttgtcgaa 
aataataaac 
gaaattt tta 
10 tgggtagaaa 
gatgctacac 
gaactaaaaa 
atatgtgaac 
tcccgttcag 
15 tgtaaaacca 
aacgaaaatg 
attacgcatg 
caaaaatgga 
agtaatatat 
20 tatttcttag 
caagaaatcg 
aaagaaaaaa 
aaagatgaag 
tatgttaaaa 
25 tgcattgatc 
ttacttaata 
gaaaaacctt 
gacgaaacac 
actgtgagtt 
30 agtgttgttg 
ggcggggtcg 
gacagtcgtg 
ggaacggact 
cctagacgtc 
35 agaaatggta 
gcagaacgaa 
gctgtgcgtt 
aaaaatcatg 
gagcaacttc 
40 aaatataaac 
atgcaatgcc 
gactacatcc 
cagtcacggc 
tgtaagcaag 
45 tttattaata 
gaagaagcaa 
gtccttcaat 
gctgaaggat 
tgtaaaaaaa 
50 tttcgtcctc 
aaggtacgga 
aacaacgatg 
tatccaaaat 
cctagaagac 
55 aataaacaag 
tcatggcagt 
ggaactattc 
tttttatttg 
ataaatattc 



ttgtaagagg 
ttttcgagaa 
ctggaaa tta 
aagccataac 
aaataatttt 
tagattatgt 
taaaaaatat 
attgtagtcg 
tggatctaga 
aaaaaaaaaa 
aaaataatga 
gttcatataa 
atattgataa 
agtattgtaa 
ttaatgaatt 
aaacccctat 
atttaaagaa 
aatgtaaata 
ataatgaacg 
aagattacaa 
tatgtataaa 
gggaagaatg 
atgtttcttc 
atgccttgga 
ctaataaacc 
gacttgaaaa 
gtgttgatat 
cacgcgccca 
atatcgctag 
atggtgataa 
gaaaaaacct 
gtactcctac 
ggcaaggtga 
aacatttttg 
atgctagcaa 
caaagaacca 
acagttttgc 
gcgagaagaa 
ttaatagcag 
aattaagaga 
caccaaaaaa 
cccaaagatt 
tgtatggaga 
gtaatggcca 
catggcaacc 
aaaagtataa 
tcttgaacaa 
atgtacacca 
gaaatggtga 
aaccacacga 
aaaaaaaaaa 
gaaccataag 
gggattgtaa 
aaaaattatg 
ataatttaag 
attataagag 
ctgacgattt 
gaaccgatat 
ttttcccaaa 



aagagatatg 
aataaataat 
ttataaatta 
atgtagagcc 
ttcagataat 
ccctcaattt 
aaaaatagga 
tgagggttac 
atgtccacgt 
agaatttaat 
taataacgaa 
agaagttaat 
aaaaattcca 
atcatgtcct 
taaatgtaga 
tgatattgat 
taattacgaa 
taaatgttac 
atatatctca 
taaattaaag 
tgaatgtaag 
gaaaaaaata 
taaacttaaa 
caaggatgaa 
caaaggaaaa 
acaaatcgat 
acctaaactt 
taatccgtgt 
acaaatgcat 
taagttggaa 
gaatggagat 
agatggacct 
tagtttcgta 
tacatcgaat 
ttcattattg 
ttatgtatct 
cgatttagga 
aacacaagaa 
tatcaaagat 
agattggtgg 
cggtactttc 
aagatggatg 
gttggtggag 
ttgtgtaacg 
tcaatggaaa 
tgatagtagc 
attactcacg 
agaagcacat 
gatcccaagt 
ccatgatgag 
aaaagttgat 
aataggacaa 
ttctcagatt 
cgtatatttt 
aaaagcattt 
taagaatggt 
taaacgtcaa 
atcaaaaggt 



tggtgtccga 



tttaaatcta 
ggattgaaga 
agggaagctt 
ccaaacgggg 
ggaccatgtg 
ttaagatggt 
aatattaaga 
gattgtaata 
tgtgcagatg 
aaacaaaaga 
aatggaatct 
agtttttttg 
atggactata 
ataacagata 
gaaataaaaa 
attctggtta 
aactgtgatc 
ttagatgtat 
attaaagtac 
gaaaaattga 
caaaattgtg 
aaagatcg tt 
aaatttttga 
acgttagata 
ccatgtaaca 
aattgcaaaa 
ctaaatgatg 
gtagataaaa 
cgaagggcaa 
ggcgatatat 
atttgcaaga 
tgtgaaggca 
aacccacaat 
ttagaaaaat 
ggtgatgtgc 
aaaaaggaag 
gatattatac 
aatttagaaa 
aaatataagg 
gaagccaatc 
ccatgtaaaa 
accgaatggg 
acgtgtggta 
tgcaagccag 
caaatggaac 
agaaaagata 
caaaacaaag 
attagtgatt 
agtgatacag 
gtgtgtgagt 
gcatgtgaaa 
tgcaaacgta 
catacaacac 
tttgcaaatc 
attatatctg 
ggtgaaaacc 
atgttctata 
cttggtgaag 
aaaattccta 



atgaaaaggt 
aaataggaat 
ggtggacggc 
ctaattattt 
gtcgtaagga 
taaatgaatg 
aatcctgtac 
aaacaaatct 
attgtaaatc 
aaaaatacga 
ataataaaaa 
aattattgaa 
ataatactga 
ttttatgtga 
gtatgcctaa 
atgttaataa 
tttttaaaaa 
gtgaaccgag 
tatttaaacg 
acccatgcat 
aatgcgtaga 
acgtacaaca 
aacaggaact 
gtatgaaaga 
ataacgatgt 
agaagcacga 
aagatgagga 
atgattctca 
aagcacaaat 
ttaaggttac 
ttgacaaaac 
aaggcgatcg 
accgtgggat 
tagatgttag 
tgctcgcagc 
aacattccga 
gaggaaaaga 
gaatatttgc 
atgatgacaa 
gttcacaggt 
gtgatcatac 
cagaatggta 
agtgtatgca 
catgtgaaaa 
aaaaatactc 
ccacaaacaa 
gaaataaaac 
gtcagaaaca 
agaccgacaa 
gtaatactag 
tggcaaaaac 
aagatgaagg 
ataatggagc 
catctcaaat 
cagcagcaga 
tccaaactca 
catatggaga 
ggactgccct 
atgaaaaaac 



agaaatcggt 
taatgattat 
taaccgcgat 
tagaaaaggt 
actaatcgtt 
gtcggaagag 
tggagaaagt 
aagacttaat 
gtatgaaaca 
aaaagaagtt 
attttatgat 
taaaggacca 
gaaaacattt 
tgataatgaa 
tataagaaaa 
caaaaaggtt 
actaggagaa 
aaatttggat 
gtggttagaa 
gtataatgta 
aaaatggata 
atatgaaagt 
gtttactaat 
atctactgaa 
cataaatatt 
agaaaaggga 
cgaagatgac 
acccactaaa 
gacaaaaaat 
atttagaaat 
gtattccaat 
gtttaaaata 
ttatatgcct 
tagagtcata 
aaagtatgaa 
agcttgtcgt 
catgtgggat 
taaaattaaa 
agcaacaccc 
atgggaagca 
accgctacat 
ctgtaaagaa 
taagggaaaa 
atataaaaaa 
ccagttatac 
agacgattat 
atatgatact 
aacacaattt 
caattatgcc 
acagaagacg 
acttttgcac 
aaatgcagaa 
atgtatgcct 
aggaagtata 
aacatttcgt 
attaaaagat 
ttatagagat 
agaaaagcaa 
acgtgaaaag 



660 
720 

780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
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tggtrggacag atcacggacc tgagatatgg aaaggtatgt tatgcgccct aacaaatggt 4200 
ctcagtgaaa gcgaaaaaaa aacaaaaata ttcgacgact actcacacga caaagtcaac 4260 
caatccaaaa atggtaaccc ttccctcgag gatttcgcaa aaaaacctca atttttcaga 4320 
tggtttattg aatggagtga tgaattttgt cgggaaagga agaagaagga agaggaggtg 4380 
5 gaaagggatt gtaaggatga atatgaggga tgtgaaaagg agaaaaatgg taaatgtgtt 4440 
accgcatgta aagcatataa agaatacatt acaaacaaaa aagaagaata tgatagtcaa 4500 
aaagggaaat ttgacgttga aaaaacagag aagaaacaag gatatgaaga ttattctgag 4560 
aaacaggctt ctgaatattt gaaagaaaaa tgtataaaat catcatgtaa ttgtatgaag 4620 
aaagttacag aaatttccaa ttactggacc aaccctcata aaacctacga caccgaaaat 4680 

10 cttggaatca aatgtgaatg ccccccttca ccctgcacca tcgtggatgg catcctcagc 4740 
ccacaaaatt cgagttcgta cgccgaaggg tgtaaatgga aatatgggaa gatgtcacaa 4800 
gggggtacgg aatgggattg tagtaaaaaa agtgggggtg aaggtggtaa tgaggacggt 4860 
gatgttgtat gtatacctcc aaggagaagg agattatatg taaagaattt acaggatttg 4 920 
actggtgaag aatcactagt ggatttacga aaagctttta ttaagtgtgc tgctatagaa 4 980 

15 acattttttg cttggcatga atttaaaaaa gaaaaagaaa gagaggaaaa agaaaaaaat 5040 
gaacaagatg tacaatataa atcatctgtc ttagaaaatc ttcaaaagca gttaaaaaat 5100 
ggagaaatag atgatgagtt taaaagacaa atgttctata catttgcaga ttatagagat 5160 
atatgtttag ggaaggatat aggtaacgac gtggatggaa ttaatgaaaa aatagataca 5220 
attttgcaaa aaaatggaaa acctaataat atcgaagaat ataaaaaatg gtggcaaaaa 5280 

20 catggtcatg agatatggga aggaatgtta tgtgctctaa gctacaatac cgaaacaaaa 5340 
gagatggata aagaacttcg caacaaatta accgaacaaa agaacggtaa caaaaacacg 5400 
tacgacaccg tcacaattag tggtggtccc attggtaata ccaaattgga gaaatttgca 54 60 
tctaggcccc cattttttcg ttggttagaa gaatgggcag atgagttttg tagaaaacga 5520 
acacataaat tggaaaaaat tcaaaatgaa tgtaagggag taagtggtac aaatcagtgt 5580 

25 gatgatgatg gttttgactg cgatgaaatg tgtccaaaaa aggatgggag ttttgaaacg 5640 
tttaaatgtc tgagttgtgc caaatcttgt agattttata aaaagtggat aagtagaaaa 5700 
aaagaggaat ttgataaaca aagcaaaaaa tacgaaaacg aaattgacga tgttaaacat 5760 
aattctgata acatatatgg aaaagacttt cttgaaactc ttgatcaaca atataagtct 5820 
gttgaattat ttttagaaaa agtaaaagga ccatgttcta ttaataataa taatgaagaa 5880 

30 tgtaaaatag attttaataa accaaaggat acatttggtc atgcaaaaaa ttgtggtcca 5940 
tgttctgaaa ttagattcaa gtgtatagag gataatagca attgggttac tacaaataca 6000 
tgcaataaaa caacttttaa gtttacagaa gataataaag atacgaaaga agatagtgaa 6060 
caattaggta tgcttattag tgataataca gtacaaaatt ttgcagatgg tttacagaat 6120 
gattgtaaag atgcagatat ctttaaaggt cttagaaaag accaatggtc atgtggttat 6180 

35 ttttgtaatt tagatatatg tagtctgaaa acttctcatg gggaaaacaa ttataaacaa 6240 
aatatattaa ttagggcatt gtttaaacga tggttagaac attttttaga agactataat 6300 
aaaattaatg acaaaatttc acattgtatg aaaaatggtg aaggatccac atgtataaaa 6360 
ggatgtgaaa taaaatgtaa ttgtgtaagt aattggataa agaagaaaac gttagaatgg 6420 
gaaatagtac gagatcgttt ctttaaacaa tataatgttg attcagaaaa atcttttaca 6480 

40 gtgaaaagtt ttttagagca ggctccattt gacagtgatg ttcaaaaagc tataaaacct 6540 
tttgaaaaat tacgtgactt cgaggattca attgtatgta atggaactac gagcgcacga 6600 
aaggaaaaag gtacagaaaa ggatgtcgta atatgtttgc ttgataaact tcaaaaacaa 6660 
atagaaactt gtcaaacgaa acataaagaa acatctggaa atacatgttc cccaccccca 6720 
aaccccgaca cacaaacaga cacaccatta ccacttgagt cttttcctcc ccctttttgt 6780 

45 aacgttcctc ctaatccatg tggcgacaaa gatgcgacca acgtggttgg agttgaagtg 6840 
ctggcgaagg aaatgcagga ggcggcacat aaaagcatgt taagtcgtag tgctgttgat 6900 
agtggtaagg gtgataaggg tgagagtagt agtggtaaga gtagtttgga aggagatata 6960 
tccttagcag aatttaaaaa tggctttaat ccaagtgggt tgaagaacgt atgtcaaata 7020 
acggaaaaac attcctatgc taatggtgca tcaaaggatc cttgtaatgg aaaaggaaac 7080 

50 ggcaaggacc agagatttaa aatagaaacc caatggaaag atacaggcaa aagcggtaaa 7140 
cacgttgacg tctatttacc tccacgacgc gaacatatat gtacctcaaa tttggaatat 7200 
ttacttaagg gtaatagtga tcagattatg aaggttggaa ataacaaaat taatcattcc 7260 
tttttgggag aggtattgct agcagcaaaa tatgaagcag aattcataaa aaccaattat 7320 
acgagattaa atggccaaaa tgacaatgga gctaaatgta gagctatgaa atatagtttt 7380 

55 gctgatatag gagatattgt acgaggaaga gatctgtggg aacataatga ttttaaaaag 7440 
ctagaacgag atttggtaaa aatatttggt aaaattaaag agggaattac tgatgagaca 7500 
accaaaaaac aatatgaaaa ggatgacaca gacaataaac aattacgttg tgattggtgg 7560 
gaagctaacc gtgatcaagt atgggaagca atgcagtgta aaacaacaat accaccagtc 7620 
accacaagtt gtgatactac cactgttacc cctcttgtgg attacatccc ccaaagatta 7680 
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cgttggatga tggaatgggc 
ttgaggaagg ggtgtgagga 
gagaattgca caaaagcttg 
tggacaaaaa taaaagaaaa 
5 tctaatagtg gtacaacata 
ttacacgaaa aaaataaaga 
caagcaaaat atttagattg 
acattaccta gcggtagaga 
gaacgagcafc gtaaatgtca 

10 tcagaggacg atagagaacg 
gaagaaatag aacaagagga 
gaggaaccag catccgaagg 
ccaagagaaa atagtgatag 
gaacaaacac cgatactcaa 

15 ggagatcgtg gcgtaggacg 
gaatccgaaa acgaagacgt 
gacgaccccg aggcggagtc 
gcggtaccac cacaacccca 
ccccttaaaa ccgccctcat 

20 gccatcagtt actttttact 
attgatatcc ataaaggcga 
ccctatgtga gtgatacata 
ggagatgaaa aatatggatt 
tatgaagaat tggatattaa 

25 atagaagtag tattggaacc 
cctaccacga atacatttac 
caatatatac aaagtgaacc 
atgaatatag taggtaatgt 
attcatgata gggatttaaa 

30 aatagtatgg atgatccaaa 
aatgattcac taagtggtgg 
gaaaatgaat tatttggaac 
ttaacaaata gtgatccaat 
catagagata tgtgtgagat 

35 caatggaata aagataatga 
acggatgttt cgattgaaat 
atggatacta tcttggataa 
ccatctgtga atgatatacc 
gttgaaatga aaatccttaa 

40 tcggatgtat ggaatatata 



24/31 

agaatggtat tgtaaatatc 
ttgtaggagt tggaaatgta 
taaagactat aatagtaaaa 
atacgaagaa ttatacaaaa 
tcccaaagat gagaaagatg 
caataagata tattatactg 
tacacaacaa acacattttt 
caatgacaaa tatgctttta 
cgagaaacag gaaccaccac 
atcagaacct ggtgaagatg 
agaacccgaa gaaacttccg 
aggtggccca tcgggatcac 
ctccgacccc aaacctgacc 
acccgaagaa gaagcaccac 
ttctttagga ccaaccccac 
cgaagacgaa gatgacgaag 
ggaggaggaa gatgaagatc 
agcaccagca cctctacccc 
gtcttctacc atcatgtgga 
aaagaaaaaa ccgaaatcac 
ttatggaata cctacattgg 
caaaggcaaa acatatatat 
tatgtctgat actactgata 
tgatatatat gtaccaggta 
atcaaaaagt aatggtaaca 
agatgaggaa tggaatgaat 
actgaatgta ccacaatatg 
tttagatgat ggtataaacg 
tagtggagaa gaaattagtt 
atatgtatca aataatgtat 
taaacctatt gatatatatg 
aaattataag aaaaatacat 
tatgaaccaa ttagatttgt 
gtggaataat aaagaggaag 
tggtggtgat atatcaagtg 
agatatggat gatcctaaag 
tatagaagat gatatatatt 
tatggatcat aataaagtag 
taatacatcc actggatcct 
a 



aatcgaaggc atatagtgag 7740 
tgaagggtga tagtaaatgt 7800 
tagaaccatg gaaacagcaa 7860 
aagcacaaaa tagtgatacc 7920 
tcgtttcttt tttgtcaaaa 7980 
ctgcaggata tatacatcaa 8040 
gtgataaaaa aaatggcgag 8100 
agaaaccgcc aaaaaaatat 8160 
ctcctaaggt accagaagat 8220 
cactccctgt actcccacca 8280 
tagacactac acaagatgag 8340 
caacagaaga aagtggggaa 8400 
aaaaccccga agccaacccc 84 60 
caaaatcaaa accacccgat 8520 
gttctgaagt tgaacccgag 8580 
aggaggaaga agaccccgac 8640 
acggcggcca ggaggcggag 8700 
ctcctccacc acctttaccc 8760 
gtgtaggtat cggttttgcg 8820 
ctgttgacct catacgtgta 8880 
aatccaaaaa tagatatatc 8940 
atatggaagg agatactagt 9000 
taacttcctc cgaaagtgag 9060 
gtcctaaata taaaacattg 9120 
cactaggtga tgatatggta 9180 
tgaaacacga ttttatatca 9240 
atgtattaaa ggagttacca 9300 
aaaaaccttt tattacttct 9360 
ataatattaa tatgagtact 9420 
attctggtat agatttaatt 9480 
atgaagtgct aaaaagaaaa 9540 
caaataacaa tgtagctaaa 9600 
tacataaatg gttagataga 9660 
tattagataa attaaaagaa 9720 
atagtaacaa aaggttgaat 9780 
gaaagaagga atttagtaat 9840 
atgatgtaaa tgatgaaaac 9900 
atgtacctaa gaaagtacat 9960 
tggaacaaca atttcctata 10020 

10041 



<210> 6 

45 <211> 3346 

<212> PRT 

<213> Plasmodium falciparum 
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Phe 


Glu 


Lys 


He 


Asn 


Asn 


Gly 


Leu 


Lys 


Lys 


He 


Gly 


He 


Asn 


Asp 


Tyr 


20 


225 










230 










235 










240 




Asn 


Asp 


He 


Ser 


Gly 


Asn 


Tyr 


Tyr 


Lys 


Leu 


Arg 


Glu 


Ala 


Trp 


Trp 


Thr 












245 










250 










255 






Ala 


Asn 


Arg 


Asp 


Gin 


Val 


Trp 


Lys 


Ala 


He 


Thr 


Cys 


Arg 


Ala 


Pro 


Asn 










260 










265 










270 






ZD 


Gly 


Ala 


Asn 


Tyr 


Phe 


Arg 


Lys 


Gly 


Leu 


Asp 


Gly 


Lys 


He 


He 


Phe 


Ser 








275 










280 










285 










Asp 


Asn 


Gly 


Pro 


Cys 


Gly Arg 


Lys 


Glu 


Leu 


He 


Val 


Pro 


Thr 


Tyr 


Leu 






290 










295 










300 












Asp 


Tyr 


Val 


Pro 


Gin 


Phe 


Leu 


Arg 


Trp 


Leu 


Asn 


Glu 


Trp 


Ser 


Glu 


Glu 


30 


305 










310 










315 










320 




Phe 


Cys 


Arg 


lie 


Lys 


Asn 


He 


Lys 


He 


Gly 


Asn 


He 


Lys 


Lys 


Ser Cys 












325 










330 










335 






Thr 


Gly 


Glu 


Ser 


Asn 


Asn 


Lys 


His 


Cys 


Ser 


Arg 


Glu 


Gly 


Tyr 


Asp 


Cys 










340 










345 










350 








Asn 


Lys 


Thr 


Asn 


Leu 


Arg 


Leu 


Asn 


Glu 


He 


Phe 


Met 


Asp 


Leu 


Glu 


Cys 








355 










360 










365 










Pro 


Arg 


Cys 


Ala 


Asp 


Asp Cys 


Lys 


Ser 


Tyr 


Glu 


Thr 


Trp 


Val 


Glu 


Lys 






370 










375 










380 












Lys 


Lys 


Lys 


Glu 


Phe 


Asn 


Lys 


Gin 


Lys 


Lys 


Lys 


Tyr 


Glu 


Lys 


Glu 


Val 


40 


385 










390 










395 










400 




Asp 


Ala 


Thr 


Gin 


Asn 


Asn 


Asp 


Asn 


Asn 


Glu 


Asn 


Gly 


He 


Tyr 


Asn 


Lys 












405 










4 10 










415 






i-ys 


Phe 


Tyr 


Asp 


Glu 


Leu 


Lys 


Ser 


Ser 


Tyr 


Lys 


Glu 


Val 


Asn 


Ser 


Phe 










4 20 










425 










430 






yl IT 


Phe 


Glu 


Leu 


Leu 


Asn 


Lys 


Gly 


Pro 


He 


Cys 


Glu 


His 


He 


Asp 


Lys 


Lys 








435 










440 










445 










He 


Pro 


Met 


Asp 


Tyr 


Asn 


Asn 


Thr 


Glu 


Lys 


Thr 


Phe 


Ser 


Arg 


Ser 


Glu 






450 










455 










4 60 












Tyr 


Cys 


Lys 


Ser 


Cys 


Pro 


He 


Thr 


Asp 


He 


Leu 


Cys 


Asp 


Asp 


Asn 


Glu 




4 65 










470 










4 75 










480 




Cys 


7 Tie 


Thr 


T 1 *a 


Asn 


Glu 


Phe 


Lys 


Cys 








Lys 


Ser 


Met 


Pro 












485 










490 










495 






Asn 


lie 


Arg 


Lys 


Asn 


Glu 


Asn 


Glu 


Thr 


Pro 


He 


Asp 


He 


Asp 


He 


Leu 










500 










505 










510 






55 


Val 


Asn 


Val 


Asn 


Asn 


Lys 


Lys 


Val 


He 


Thr 


His 


Asp 


Leu 


Lys 


Asn 


Asn 








515 










520. 










525 










Tyr 


Glu 


Asn 


Cys 


Asp 


Leu 


Phe 


Lys 


Lys 


Leu 


Gly 


Glu 


Gin 


Lys 


Trp 


Lys 






530 










535 










540 












Cys 


Lys 


Tyr 


Lys 


Cys 


Tyr 


Leu 


Asp 


Val 


Cys 


Glu 


Pro 


Arg 


Asn 


Leu Asp 
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545 550 555 560 

Ser Asn lie Tyr Asn Glu Arg Tyr lie Ser lie Lys Val Leu Phe Lys 

565 570 575 

Arg Trp Leu Glu Tyr Phe Leu Glu Asp Tyr Asn Lys Leu Lys Glu Lys 
5 580 585 590 

Leu Asn Pro Cys Met Tyr Asn Val Gin Glu lie Val Cys lie Asn Glu 

595 600 605 

Cys Lys Gin Asn Cys Glu Cys Val Glu Lys Trp lie Lys Glu Lys Arg 
610 615 620 

10 Glu Glu Trp Lys Lys He Lys Asp Arg Tyr Val Gin Gin Tyr Glu Ser 
625 630 635 640 

Lys Asp Glu Asp Val Ser Ser Lys Leu Lys Lys Phe Leu Lys Gin Glu 

645 650 655 

Leu Phe Thr Asn Tyr Val Lys Asn Ala Leu Asp Lys Asp Glu Thr Leu 
15 660 665 670 

Asp Ser Met Lys Glu Ser Thr Glu Cys He Asp Pro Asn Lys Pro Lys 

675 680 685 

Gly Lys Pro Cys Asn Asn Asn Asp Val He Asn He Leu Leu Asn Arg 
690 695 700 

20 Leu Glu Lys Gin He Asp Asn Cys Lys Lys Lys His Glu Glu Lys Gly 
705 710 715 720 

Glu Lys Pro Cys Val Asp He Pro Lys Leu Leu Asn Asp Glu Asp Glu 

725 730 735 

Asp Glu Asp Asp Asp Glu Thr Pro Arg Ala His Asn Pro Cys Val Asp 
25 740 745 750 

Lys Asn Asp Ser Gin Pro Thr Lys Thr Val Ser Tyr He Ala Arg Gin 

755 760 765 

Met His Arg Arg Ala Lys Ala Gin Met Thr Lys Asn Ser Val Val Asp 
770 775 780 

30 Gly Asp Asn Lys Leu Glu Gly Asp He Phe Lys Val Thr Phe Arg Asn 
785 790 795 800 

Gly Gly Val Gly Lys Asn Leu Asn Gly Asp He Cys Lys He Asp Lys 

805 810 815 

Thr Tyr Ser Asn Asp Ser Arg Gly Thr Pro Thr Asp Gly Pro Cys Glu 
35 820 825 830 

Gly Lys Gly Asp Arg Phe Lys He Gly Thr Asp Trp Gin Gly Asp Ser 

835 840 845 

Phe Val Asn Pro Gin Tyr Arg Gly He Tyr Met Pro Pro Arg Arg Gin 
850 855 860 

40 His Phe Cys Thr Ser Asn Leu Glu Lys Leu Asp Val Ser Arg Val He 
865 870 875 880 

Arg Asn Gly Asn Ala Ser Asn Ser Leu Leu Gly Asp Val Leu Leu Ala 

885 890 895 

Ala Lys Tyr Glu Ala Glu Arg Thr Lys Asn His Tyr Val Ser Lys Lys 
45 900 905 910 

Glu Glu His Ser Glu Ala Cys Arg Ala Val Arg Tyr Ser Phe Ala Asp 

915 920 925 

Leu Gly Asp He He Arg Gly Lys Asp Met Trp Asp Lys Asn His Gly 
930 935 940 

50 Glu Lys Lys Thr Gin Glu Asn Leu Glu Arg He Phe Ala Lys He Lys 
945 950 955 960 

Glu Gin Leu Leu Asn Ser Ser He Lys Asp Lys Tyr Lys Asp Asp Asp 

965 970 975 

Lys Ala Thr Pro Lys Tyr Lys Gin Leu Arg Glu Asp Trp Trp Glu Ala 
55 980 985 990 

Asn Arg Ser Gin Val Trp Glu Ala Met Gin Cys Pro Pro Lys Asn Gly 

995 1000 1005 

Thr Phe Pro Cys Lys Ser Asp His Thr Pro Leu His Asp Tyr He Pro 
1010 1015 1020 
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Gin Arq Leu Arq Trp Met Thr Glu Trp Ala Glu Trp Tyr Cys Lys Glu 
1025 1030 1035 1040 

Gin Ser Arq Leu Tyr Gly Glu Leu Val Glu Thr Cys Gly Lys Cys Met 
1045 1050 1055 

5 His Lys Gly Lys Cys Lys Gin Gly Asn Gly His Cys Val Thr Cys Lys 
1060 1065 1070 

Pro Ala Cys Glu Lys Tyr Lys Lys Phe He Asn Thr Trp Gin Pro Gin 

1075 1080 1085 

Trp Lys Gin Met Glu Gin Lys Tyr Ser Gin Leu Tyr Glu Glu Ala Lys 
10 1090 1095 1100 

Lvs Tyr Asn Asp Ser Ser Arg Lys Asp Thr Thr Asn Lys Asp Asp Tyr 
li05 1110 1115 1120 

Val Leu Gin Phe Leu Asn Lys Leu Leu Thr Gin Asn Lys Gly Asn Lys 
1125 1130 1135 

15 Thr Tyr Asp Thr Ala Glu Gly Tyr Val His Gin Glu Ala His He Ser 
1140 1145 1150 

Asp Cys Gin Lys Gin Thr Gin Phe Cys Lys Lys Arg Asn Gly Glu He 

1155 1160 1165 

Pro Ser Ser Asp Thr Glu Thr Asp Asn Asn Tyr Ala Phe Arg Pro Gin 
20 1170 1175 1180 

Pro His Asp His Asp Glu Val Cys Glu Cys Asn Thr Arg Gin Lys Thr 
1185 1190 1195 1200 

Lvs Val Arg Lys Lys Lys Lys Lys Val Asp Ala Cys Glu Met Ala Lys 
1205 1210 1215 

25 Thr Leu Leu His Asn Asn Asp Gly Thr He Arg He Gly Gin Cys Lys 
1220 1225 1230 

Arg Lys Asp Glu Gly Asn Ala Glu Tyr Pro Lys Trp Asp Cys Asn Ser 

1235 1240 1245 

Gin He His Thr Thr His Asn Gly Ala Cys Met Pro Pro Arg Arg Gin 
30 1250 1255 1260 

Lvs Leu Cys Val Tyr Phe Phe Ala Asn Pro Ser Gin He Gly Ser He 
3265 1270 1275 1280 

Asn Lys Gin Asp Asn Leu Arg Lys Ala Phe He He Ser Ala Ala Ala 
1285 1290 1295 

35 Glu Thr Phe Arg Ser Trp Gin Tyr Tyr Lys Ser Lys Asn Gly Gly Glu 
1300 1305 1310 

Asn Leu Gin Thr Gin Leu Lys Asp Gly Thr He Pro Asp Asp Phe Lys 

1315 1320 1325 

Arg Gin Met Phe Tyr Thr Tyr Gly Asp Tyr Arg Asp Phe Leu Phe Gly 
40 1330 1335 1340 

Thr Asp He Ser Lys Gly Leu Gly Glu Gly Thr Ala Leu Glu Lys Gin 
1345 1350 1355 1360 

He Asn He Leu Phe Pro Asn Gly Val Arg Lys He Pro Asn Glu Lys 
1355 1370 1375 

45 Thr Arq Glu Lys Trp Trp Thr Asp His Gly Pro Glu He Trp Lys Gly 
13B0 1385 1390 

Met Leu Cys Ala Leu Thr Asn Gly Leu Ser Glu Ser Glu Lys Lys Thr 

1395 1400 1^05 

Lys He Phe Asp Asp Tyr Ser His Asp Lys Val Asn Gin Ser Lys Asn 
50 1410 1415 1420 

Gly Asn Pro Ser Leu Glu Asp Phe Ala Lys Lys Pro Gin Phe Phe Arg 
1425 1430 1435 1440 

Trp Phe He Glu Trp Ser Asp Glu Phe Cys Arg Glu Arg Lys Lys Lys 
1445 1450 1455 

55 Glu Glu Glu Val Glu Arg Asp Cys Lys Asp Glu Tyr Glu Gly Cys Glu 
1460 1465 1470 

Lys Glu Lys Asn Gly Lys Cys Val Thr Ala Cys Lys Ala Tyr Lys Glu 

1475 1480 1485 

Tyr He Thr Asn Lys Lys Glu Glu Tyr Asp Ser Gin Lys Gly Lys Phe 
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1490 1495 1500 

Asp Val Glu Lys Thr Glu Lys Lys Gin Gly Tyr Glu Asp Tyr Ser Glu 
1505 1510 1515 1520 

Lys Gin Ala Ser Glu Tyr Leu Lys Glu Lys Cys lie Lys Ser Ser Cys 
5 1525 1530 1535 

Asn Cys Met Lys Lys Val Thr Glu He Ser Asn Tyr Trp Thr Asn Pro 

1540 1545 1550 

His Lys Thr Tyr Asp Thr Glu Asn Leu Gly He Lys Cys Glu Cys Pro 
1555 1560 1565 

10 Pro Ser Pro Cys Thr He Val Asp Gly He Leu Ser Pro Gin Asn Ser 
1570 1575 1580 

Ser Ser Tyr Ala Glu Gly Cys Lys Trp Lys Tyr Gly Lys Met Ser Gin 
1585 1590 1595 1600 

Gly Gly Thr Glu Trp Asp Cys Ser Lys Lys Ser Gly Gly Glu Gly Gly 
15 1605 1610 1615 

Asn Glu Asp Gly Asp Val Val Cys He Pro Pro Arg Arg Arg Arg Leu 

1620 1625 1630 

Tyr Val Lys Asn Leu Gin Asp Leu Thr Gly Glu Glu Ser Leu Val Asp 
1635 1640 1645 

20 Leu Arg Lys Ala Phe He Lys Cys Ala Ala He Glu Thr Phe Phe Ala 
1650 1655 1660 

Trp His Glu Phe Lys Lys Glu Lys Glu Arg Glu Glu Lys Glu Lys Asn 
1665 1670 1675 1680 

Glu Gin Asp Val Gin Tyr Lys Ser Ser Val Leu Glu Asn Leu Gin Lys 
25 1685 1690 1695 

Gin Leu Lys Asn Gly Glu He Asp Asp Glu Phe Lys Arg Gin Met Phe 

1700 1705 1710 

Tyr Thr Phe Ala Asp Tyr Arg Asp He Cys Leu Gly Lys Asp He Gly 
1715 1720 1725 

30 Asn Asp Val Asp Gly He Asn Glu Lys He Asp Thr He Leu Gin Lys 
1730 1735 1740 

Asn Gly Lys Pro Asn Asn He Glu Glu Tyr Lys Lys Trp Trp Gin Lys 
1745 1750 1755 1760 

His Gly His Glu He Trp Glu Gly Met Leu Cys Ala Leu Ser Tyr Asn 
35 1765 1770 1775 

Thr Glu Thr Lys Glu Met Asp Lys Glu Leu Arg Asn Lys Leu Thr Glu 

1780 1785 1790 

Gin Lys Asn Gly Asn Lys Asn Thr Tyr Asp Thr Val Thr He Ser Gly 
1795 IBOO 1805 

40 Gly Pro He Gly Asn Thr Lys Leu Glu Lys Phe Ala Ser Arg Pro Pro 
1810 1815 1820 

Phe Phe Arg Trp Leu Glu Glu Trp Ala Asp Glu Phe Cys Arg Lys Arg 
1825 1830 1835 1840 

Thr His Lys Leu Glu Lys He Gin Asn Glu Cys Lys Gly Val Ser Gly 
45 1845 1850 1855 

Thr Asn Gin Cys Asp Asp Asp Gly Phe Asp Cys Asp Glu Met Cys Pro 

1860 1865 1870 

Lys Lys Asp Gly Ser Phe Glu Thr Phe Lys Cys Leu Ser Cys Ala Lys 
1875 1880 1885 

50 Ser Cys Arg Phe Tyr Lys Lys Trp He Ser Arg Lys Lys Glu Glu Phe 
1890 1895 1300 

Asp Lys Gin Ser Lys Lys Tyr Glu Asn Glu He Asp Asp Val Lys His 
1905 1910 1915 1920 

Asn Ser Asp Asn He Tyr Gly Lys Asp Phe Leu Glu Thr Leu Asp Gin 
55 1925 1930 1935 

Gin Tyr Lys Ser Val Glu Leu Phe Leu Glu Lys Val Lys Gly Pro Cys 

1940 1945 1950 

Ser He Asn Asn Asn Asn Glu Glu Cys Lys He Asp Phe Asn Lys Pro 
1955 I960 1365 
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Lys Asp Thr Phe Gly His Ala Lys Asn Cys Gly Pro Cys Ser Glu lie 

1970 1975 1980 

Arg Phe Lys Cys lie Glu Asp Asn Ser Asn Trp Val Thr Thr Asn Thr 
1985 1990 1995 2000 

5 Cys Asn Lys Thr Thr Phe Lys Phe Thr Glu Asp Asn Lys Asp Thr Lys 

2005 2010 2015 

Glu Asp Ser Glu Gin Leu Gly Met Leu lie Ser Asp Asn Thr Val Gin 

2020 2025 2030 

Asn Phe Ala Asp Gly Leu Gin Asn Asp Cys Lys Asp Ala Asp lie Phe 
10 2035 2040 2045 

Lys Gly Leu Arg Lys Asp Gin Trp Ser Cys Gly Tyr Phe Cys Asn Leu 

2050 2055 2060 

Asp lie Cys Ser Leu Lys Thr Ser His Gly Glu Asn Asn Tyr Lys Gin 
2065 2070 2075 2080 

15 Asn lie Leu lie Arg Ala Leu Phe Lys Arg Trp Leu Glu His Phe Leu 

2085 2090 2095 

Glu Asp Tyr Asn Lys lie Asn Asp Lys lie Ser His Cys Met Lys Asn 

2100 2105 2110 

Gly Glu Gly Ser Thr Cys lie Lys Gly Cys Glu lie Lys Cys Asn Cys 
20 2115 2120 2125 

Val Ser Asn Trp lie Lys Lys Lys Thr Leu Glu Trp Glu lie Val Arg 

2130 2135 2140 

Asp Arg Phe Phe Lys Gin Tyr Asn Val Asp Ser Glu Lys Ser Phe Thr 
2145 2150 2155 2160 

25 Val Lys Ser Phe Leu Glu Gin Ala Pro Phe Asp Ser Asp Val Gin Lys 

2165 2170 2175 

Ala He Lys Pro Phe Glu Lys Leu Arg Asp Phe Glu Asp Ser He Val 

2180 2185 2190 

Cys Asn Gly Thr Thr Ser Ala Arg Lys Glu Lys Gly Thr Glu Lys Asp 
30 2195 2200 2205 

Val Val He Cys Leu Leu Asp Lys Leu Gin Lys Gin He Glu Thr Cys 

2210 2215 2220 

Gin Thr Lys His Lys Glu Thr Ser Gly Asn Thr Cys Ser Pro Pro Pro 
2225 2230 2235 2240 

35 Asn Pro Asp Thr Gin Thr Asp Thr Pro Leu Pro Leu Glu Ser Phe Pro 

2245 2250 2255 

Pro Pro Phe Cys Asn Val Pro Pro Asn Pro Cys Gly Asp Lys Asp Ala 

2260 2265 2270 

Thr Asn Val Val Gly Val Glu Val Leu Ala Lys Glu Met Gin Glu Ala 
40 2275 2280 2285 

Ala His Lys Ser Met Leu Ser Arg Ser Ala Val Asp Ser Gly Lys Gly 

2290 2295 2300 

Asp Lys Gly Glu Ser Ser Ser Gly Lys Ser Ser Leu Glu Gly Asp He 
2305 2310 2315 2320 

45 Ser Leu Ala Glu Phe Lys Asn Gly Phe Asn Pro Ser Gly Leu Lys Asn 

2325 2330 2335 

Val Cys Gin He Thr Glu Lys His Ser Tyr Ala Asn Gly Ala Ser Lys 

2340 2345 2350 

Asp Pro Cys Asn Gly Lys Gly Asn Gly Lys Asp Gin Arg Phe Lys He 
50 2355 2360 2365 

Glu Thr Gin Trp Lys Asp Thr Gly Lys Ser Gly Lys His Val Asp Val 

2370 2375 2380 

Tyr Leu Pro Pro Arg Arg Glu His He Cys Thr Ser Asn Leu Glu Tyr 
2385 2390 2395 2400 

55 Leu Leu Lys Gly Asn Ser Asp Gin He Met Lys Val Gly Asn Asn Lys 

2405 2410 2415 

He Asn His Ser Phe Leu Gly Glu Val Leu Leu Ala Ala Lys Tyr Glu 

2420 2425 2430 

Ala Glu Phe He Lys Thr Asn Tyr Thr Arg Leu Asn Gly Gin Asn Asp 
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2435 2440 2445 

Asn Gly Ala Lys Cys Arg Ala Met Lys Tyr Ser Phe Ala Asp He Gly 

2450 2455 2460 

Asp He Val Arg Gly Arg Asp Leu Trp Glu His Asn Asp Phe Lys Lys 
5 2465 2470 2475 2480 

Leu Glu Arg Asp Leu Val Lys He Phe Gly Lys He Lys Glu Gly He 

2485 2490 2495 

Thr Asp Glu Thr Thr Lys Lys Gin Tyr Glu Lys Asp Asp Thr Asp Asn 
2500 2505 2510 

10 Lys Gin Leu Arg Cys Asp Trp Trp Glu Ala Asn Arg Asp Gin Val Trp 
2515 2520 2525 

Glu Ala Met Gin Cys Lys Thr Thr He Pro Pro Val Thr Thr Ser Cys 

2530 2535 2540 

Asp Thr Thr Thr Val Thr Pro Leu Val Asp Tyr He Pro Gin Arg Leu 
15 2545 2550 2555 2560 

Arg Trp Met Met Glu Trp Ala Glu Trp Tyr Cys Lys Tyr Gin Ser Lys 

2565 2570 2575 

Ala Tyr Ser Glu Leu Arg Lys Gly Cys Glu Asp Cys Arg Ser Trp Lys 
2580 2585 2590 

20 Cys Met Lys Gly Asp Ser Lys Cys Glu Asn Cys Thr Lys Ala Cys Lys 
2595 2600 2605 

Asp Tyr Asn Ser Lys He Glu Pro Trp Lys Gin Gin Trp Thr Lys He 

2610 2615 2620 

Lys Glu Lys Tyr Glu Glu Leu Tyr Lys Lys Ala Gin Asn Ser Asp Thr 
25 2625 2630 2635 2640 

Ser Asn Ser Gly Thr Thr Tyr Pro Lys Asp Glu Lys Asp Val Val Ser 

2645 2650 2655 

Phe Leu Ser Lys Leu His Glu Lys Asn Lys Asp Asn Lys He Tyr Tyr 
2660 2665 2670 

30 Thr Ala Ala Gly Tyr He His Gin Gin Ala Lys Tyr Leu Asp Cys Thr 
2675 2680 2685 

Gin Gin Thr His Phe Cys Asp Lys Lys Asn Gly Glu Thr Leu Pro Ser 

2690 2695 2700 

Gly Arg Asp Asn Asp Lys Tyr Ala Phe Lys Lys Pro Pro Lys Lys Tyr 
35 2705 2710 2715 2720 

Glu Arg Ala Cys Lys Cys His Glu Lys Gin Glu Pro Pro Pro Pro Lys 

2725 2730 2735 

Val Pro Glu Asp Ser Glu Asp Asp Arg Glu Arg Ser Glu Pro Gly Glu 
2740 2745 2750 

40 Asp Ala Leu Pro Val Leu Pro Pro Glu Glu He Glu Gin Glu Glu Glu 
2755 2760 2765 

Pro Glu Glu Thr Ser Val Asp Thr Thr Gin Asp Glu Glu Glu Pro- Ala 

2770 2775 2780 

Ser Glu Gly Gly Gly Pro Ser Gly Ser Pro Thr Glu Glu Ser Gly Glu 
45 2785 2790 2795 2800 

Pro Arg Glu Asn Ser Asp Ser Ser Asp Pro Lys Pro Asp Gin Asn Pro 

2805 2810 2815 

Glu Ala Asn Pro Glu Gin Thr Pro He Leu Lys Pro Glu Glu Glu Ala 
2820 2825 2830 

50 Pro Pro Lys Ser Lys Pro Pro Asp Gly Asp Arg Gly Val Gly Arg Ser 
2835 2840 2845 

Leu Gly Pro Thr Pro Arg Ser Glu Val Glu Pro Glu Glu Ser Glu Asn 

2850 2855 2860 

Glu Asp Val Glu Asp Glu Asp Asp Glu Glu Glu Glu Glu Asp Pro Asp 
55 2865 2870 2875 2880 

Asp Asp Pro Glu Ala Glu Ser Glu Glu Glu Asp Glu Asp His Gly Gly 

2885 2890 2895 

Gin Glu Ala Glu Ala Val Pro Pro Gin Pro Gin Ala Pro Ala Pro Leu 
2900 2905 2910 
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Pro Pro Pro Pro Pro Pro Leu Pro Pro Leu Lys Thr Ala Leu Met Ser 

2915 2920 2925 

Ser Thr lie Met Trp Ser Val Gly lie Gly Phe Ala Ala lie Ser Tyr 
2930 2935 29410 

5 Phe Leu Leu Lys Lys Lys Pro Lys Ser Pro Val Asp Leu lie Arg Val 
2945 2950 2955 2960 

lie Asp lie His Lys Gly Asp Tyr Gly He Pro Thr Leu Glu Ser Lys 

2965 2970 2975 

Asn Arg Tyr lie Pro Tyr Val Ser Asp Thr Tyr Lys Gly Lys Thr Tyr 
10 2980 2985 2990 

He Tyr Met Glu Gly Asp Thr Ser Gly Asp Glu Lys Tyr Gly Phe Met 

2995 3000 3005 

Ser Asp Thr Thr Asp He Thr Ser Ser Glu Ser Glu Tyr Glu Glu Leu 
3010 3015 3020 

15 Asp He Asn Asp He Tyr Val Pro Gly Ser Pro Lys Tyr Lys Thr Leu 
3025 3030 3035 3040 

He Glu Val Val Leu Glu Pro Ser Lys Ser Asn Gly Asn Thr Leu Gly 

3045 3050 3055 

Asp Asp Met Val Pro Thr Thr Asn Thr Phe Thr Asp Glu Glu Trp Asn 
20 3060 3065 3070 

Glu Leu Lys His Asp Phe He Ser Gin Tyr He Gin Ser Glu Pro Leu 

3075 3080 3085 

Asn Val Pro Gin Tyr Asp. Val Leu Lys Glu Leu Pro Met Asn He Val 
3090 3095 3100 

25 Gly Asn Val Leu Asp Asp Gly He Asn Glu Lys Pro Phe He Thr Ser 
3105 3110 3115 3120 

He His Asp Arg Asp Leu Asn Ser Gly Glu Glu He Ser Tyr Asn He 

3125 3130 3135 

Asn Met Ser Thr Asn Ser Met Asp Asp Pro Lys Tyr Val Ser Asn Asn 
30 3140 3145 3150 

Val Tyr Ser Gly He Asp Leu He Asn Asp Ser Leu Ser Gly Gly Lys 

3155 3160 3165 

Pro He Asp He Tyr Asp Glu Val Leu Lys Arg Lys Glu Asn Glu Leu 
3170 3175 3180 

35 Phe Gly Thr Asn Tyr Lys Lys Asn Thr Ser Asn Asn Asn Val Ala Lys 
3185 3190 3195 3200 

Leu Thr Asn Ser Asp Pro He Met Asn Gin Leu Asp Leu Leu His Lys 

3205 3210 3215 

Trp Leu Asp Arg His Arg Asp Met Cys Glu Met Trp Asn Asn Lys Glu 
40 3220 3225 3230 

Glu Val Leu Asp Lys Leu Lys Glu Gin Trp Asn Lys Asp Asn Asp Gly 

3235 3240 3245 

Gly Asp He Ser Ser Asp Ser Asn Lys Arg Leu Asn Thr Asp Val Ser 
3250 3255 3260 

45 He Glu He Asp Met Asp Asp Pro Lys Gly Lys Lys Glu Phe Ser Asn 
3265 3270 3275 3280 

Met Asp Thr He Leu Asp Asn He Glu Asp Asp He Tyr Tyr Asp Val 

3285 3290 3295 

Asn Asp Glu Asn Pro Ser Val Asn Asp He Pro Met Asp His Asn Lys 
50 3300 3305 3310 

Val Asp Val Pro Lys Lys Val His Val Glu Met Lys He Leu Asn Asn 

3315 3320 3325 

Thr Ser Thr Gly Ser Leu Glu Gin Gin Phe Pro He Ser Asp Val Trp 
3330 3335 3340 

55 Asn He 
3345 



